deepfind-cli

A multi-agent research tool in both Web App and CLI, which can:

Search (no limit) in Google / Baidu / Xiaohongshu via opencli
Search BOSS Zhipin jobs via opencli
Follow up on BOSS Zhipin job chats via opencli
Watch video in Bilibili / YouTube and summarize via opencli
Customize any other cli as agent tool

Web App Usage

CLI Usage

uv run -m deepfind.cli "Help me do a research on Robotic Hands, give me a TLDR table" --num-agent 1

uv run -m deepfind.cli "Help me summarize nVidia press conference https://www.bilibili.com/video/BV1EhwmzsEqB - What's new?" --num-agent 1

uv run -m deepfind.cli "Help me summarize https://www.bilibili.com/video/BV1tew5zVEDf What Saining Xie point of view in the interview" --num-agent 1 --quiet

uv run -m deepfind.cli "Help me summarize this YouTube talk https://www.youtube.com/watch?v=dQw4w9WgXcQ" --num-agent 1

uv run -m deepfind.cli "How people think Elon Musk in Xiaohongshu?" --num-agent 2

uv run -m deepfind.cli "Help me search Agent jobs on Boss" --num-agent 1

uv run -m deepfind.cli --list-tools

Install

python3 -m pip install -e .

Install opencli as an optional dependency when you want broad web search through web_search, YouTube transcript support through youtube_transcribe, or BOSS Zhipin workflows through boss_search / boss_detail / boss_chatlist / boss_send:

npm install -g @jackwener/opencli

Install optional ASR dependencies only when you need Bilibili transcription:

python3 -m pip install -e ".[media]"

Pre-download the ASR model on Windows (PowerShell) to avoid first-run delay:

hf download Qwen/Qwen3-ASR-1.7B --repo-type model

uv tool install bilibili-cli
uv tool install xiaohongshu-cli
uv tool install twitter-cli
bili whoami
xhs whoami
twitter whoami

Env

The CLI auto-loads .env from the repo root.

cp .env.example .env

Minimal .env:

QWEN_API_KEY=...
QWEN_MODEL_NAME=qwen3-max

Optional:

QWEN_BASE_URL=https://dashscope.aliyuncs.com/compatible-mode/v1
OPENCLI_BIN=opencli
TWITTER_CLI_BIN=twitter
XHS_CLI_BIN=xhs
BILI_BIN=bili
ASR_MODEL=Qwen/Qwen3-ASR-1.7B
DEEPFIND_AUDIO_DIR=audio
DEEPFIND_TOOL_TIMEOUT=90
GOOGLE_NANO_BANANA_API_KEY=...
GOOGLE_NANO_BANANA_MODEL=gemini-3.1-flash-image-preview
DEEPFIND_IMAGE_DIR=tmp
DEEPFIND_IMAGE_SIZE=2K

Run

uv run -m deepfind.cli "What's new in Xiaohongshu?" --num-agent 2
uv run -m deepfind.cli "Help me summarize video https://www.bilibili.com/video/BV1tew5zVEDf" --num-agent 1
uv run -m deepfind.cli "Help me summarize video https://www.youtube.com/watch?v=dQw4w9WgXcQ" --num-agent 1
uv run -m deepfind.cli "same query" --num-agent 2 --quiet
uv run -m deepfind.cli "same query"

Flags:

query: required unless using --list-tools
--num-agent: 1..4
--max-iter-per-agent: default 50
--quiet: disable formatted progress output
--once: always exit after the first answer
--list-tools: print built-in tool names/descriptions and exit

Chat Mode

When deepfind runs in a real terminal, it now stays open after the first answer so you can continue the conversation with the same multi-agent/tool setup.

uv run -m deepfind.cli "first question"

Follow-up commands:

Press Enter on a blank line to skip it.
Type exit or quit to leave the session.
Use --once to force the old one-shot behavior.
If GOOGLE_NANO_BANANA_API_KEY is set, you can ask for an image after a summary and it will be saved under tmp/.
You can also ask for one or more standalone HTML slides after a summary and they will be saved under tmp/.

uv run -m deepfind.cli "first question" --once

Example follow-up in chat mode:

Generate a 16:9 cover image from that summary and save it under tmp/
Generate 3 HTML slides from that summary and save them under tmp/

Web App

The repo also includes a local web chat UI under web/ with saved chats, live run activity, an iPhone-friendly mobile drawer, and a mode switch for Fast (1 agent) and Expert (4 agents).

Backend:

uv run deepfind-web --reload

Frontend:

cd web
npm install
npm run dev

Open the Vite URL during development, or run npm run build so deepfind-web can serve web/dist directly.

The production shell now includes a web manifest, touch icon, and iPhone Safari safe-area handling so it works cleanly in Safari and when added to the Home Screen.

Deploy To Raspberry Pi

The repo includes deployment scripts for a Raspberry Pi 5 running Ubuntu at david@192.168.0.205.

One-time SSH key bootstrap from your local machine:

./scripts/bootstrap_rpi_ssh_key.sh

Notes:

The bootstrap script is now key-only. It verifies passwordless SSH access and, if access is missing, tells you which public key to install on the Pi.
It uses your existing ~/.ssh/id_ed25519.pub or ~/.ssh/id_rsa.pub by default.
Override PUBKEY_PATH, RPI_HOST, RPI_USER, or RPI_SSH_PORT if needed.

Deploy the app after SSH keys are working:

./scripts/deploy_rpi.sh

What the deploy script does:

Syncs the repo to /home/david/apps/deepfind-cli
Copies the local .env file to the Pi
Installs uv if it is missing
Runs uv sync --frozen, npm ci, and npm run build on the Pi
Installs and restarts a user-level deepfind-web systemd service with systemctl --user

No-password deploy requirements:

Passwordless SSH to the Pi must already work for david@<host>.
The Pi already needs python3 3.11+, curl, tar, and systemd.
deploy_rpi.sh will install a user-local Node.js 20.x automatically when node/npm are missing or too old.
deploy_rpi.sh will also install @jackwener/opencli under the same user so web_search works after deploy.
Boot persistence for the user service requires a one-time admin command on the Pi:

sudo loginctl enable-linger david

Default LAN URL after deployment:

http://192.168.0.205:8000

Useful checks on the Pi:

systemctl --user status deepfind-web
curl http://127.0.0.1:8000/api/health

How It Works

Lead agent splits the query into a few tasks.
Sub-agents call local tools such as web_search, boss_search, boss_detail, boss_chatlist, boss_send, xhs_search_user, xhs_user, xhs_user_posts, xhs_read, twitter_search, twitter_read, bili_transcribe, and youtube_transcribe.
Lead agent merges the results into one answer.

Qwen is used through the OpenAI-compatible chat.completions API.

Video Transcription Tools

bili_transcribe is available to sub-agents and accepts either a Bilibili video URL or a raw BV... ID. It returns transcript text only (no summary generation). If audio/transcripts/<BVID>.txt already exists, the tool reuses it and skips download + ASR transcription.

youtube_transcribe is available to sub-agents and accepts either a YouTube video URL or a raw video ID. It calls opencli youtube transcript and caches grouped transcript text under audio/transcripts/youtube/<VIDEO_ID>.txt.

Setup (WSL):

uv tool install bilibili-cli
bili status

Artifacts:

Segments: audio/<BVID>/seg_*
Transcript: audio/transcripts/<BVID>.txt

Test

python3 -m unittest discover -s tests -v

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
deepfind		deepfind
deepfind_cli.egg-info		deepfind_cli.egg-info
doc		doc
scripts		scripts
tests		tests
web		web
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deepfind-cli

Web App Usage

CLI Usage

Install

Env

Run

Chat Mode

Web App

Deploy To Raspberry Pi

How It Works

Video Transcription Tools

Test

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

deepfind-cli

Web App Usage

CLI Usage

Install

Env

Run

Chat Mode

Web App

Deploy To Raspberry Pi

How It Works

Video Transcription Tools

Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages