arxiv-digest

A Claude Code-powered research assistant that fetches daily arXiv papers, scores them against your project, and generates standardized 1-page LaTeX summary cards + short podcast episodes.

You tell it what you're working on. It tells you what came out today that matters — and why.

How it works

There are three commands you can run inside Claude Code. Each one is a multi-phase workflow that orchestrates Python scripts and parallel subagents.

/arxiv-daily [context_file] [--output path]     Fetch today's papers, score, analyze the best ones
/arxiv-search [context] "query"                 Search arXiv for a specific topic
/paper-analysis <paper> [--output path]         Deep-dive on one or more specific papers

The daily workflow is the main one. Here's what happens when you run it:

┌─────────────────────────────────────────────────────────────────────┐
│  /arxiv-daily ./my-project/README.md                                │
└──────────────────────────┬──────────────────────────────────────────┘
                           │
                           ▼
                ┌─────────────────────┐
                │  1. Fetch papers    │  arxiv_tool.py daily --context ...
                │     from arXiv API  │  → cache/2026-03-02.json
                └─────────┬───────────┘
                          │
                          ▼
                ┌─────────────────────┐
                │  2. Read your       │  Understand your research project
                │     context file    │  so scoring is project-specific
                └─────────┬───────────┘
                          │
                          ▼
                ┌─────────────────────┐
                │  3. Score & rank    │  Each paper gets 1-10
                │     every paper     │  based on relevance to your work
                └─────────┬───────────┘
                          │
                          ▼
                ┌─────────────────────┐
                │  4. Show you the    │  Top Picks (≥ threshold)
                │     digest          │  Worth a Look (threshold-3 .. threshold)
                └─────────┬───────────┘  Quick Scan (rest)
                          │
                          ▼
          ┌───────────────┴──────────────────────┐
          │  For each Top Pick, launch in        │
          │  parallel:                           │
          │  • paper-analyst subagent            │
          │  • podcast-generator subagent        │
          └───────────────┬──────────────────────┘
                          │
              ┌───────────┴───────────┐
              ▼                       ▼
    ┌─────────────────┐     ┌─────────────────┐
    │  Analysis       │     │  Podcast        │
    │  subagent       │     │  subagent       │
    │                 │     │                 │
    │  • Fetch full   │     │  • NotebookLM   │
    │    text         │     │    audio gen    │
    │  • Deep analyze │     │  • Context-     │
    │  • Write .tex   │     │    aware: skips │
    │  • Compile PDF  │     │    basics you   │
    └────────┬────────┘     │    already know │
             │              └────────┬────────┘
             ▼                       ▼
      paper-card-*.pdf      podcast-*.mp3
                          │
                          ▼
                ┌─────────────────────┐
                │  6. Save digest     │  digests/2026-03-02/
                │     + PDF report    │  ├── digest.md
                └─────────────────────┘  ├── paper-card-*.tex/.pdf
                                         └── digest-report.pdf
                                         podcasts/2026-03-02/
                                         └── podcast-*.mp3

Setup

Install dependencies:

cd arxiv-digest
pip install -r requirements.txt

Configure your interests:

Edit config.yaml to set your arXiv categories, keywords, and a description of what you're working on:

categories:
  - cs.AI
  - cs.CL
  - cs.LG
  - stat.ML

max_papers: 100
lookback_days: 1

interests: |
  I work on efficient attention mechanisms and transformer architectures.
  Interested in: sparse attention, linear attention, long-context models...

keywords:
  - attention mechanism
  - transformer efficiency
  - long context

interests and keywords are your default scoring context — they're used when you run /arxiv-daily or /arxiv-search with no context file. If you pass one (e.g., /arxiv-daily ./my-project/README.md), that file takes over and these fields are ignored.

Usage

The recommended way to use this is through Claude Code slash commands. Open Claude Code in the arxiv-digest directory. The main command is:

/arxiv-daily

This fetches today's papers, scores each one 1-10 against your config interests, shows you a ranked digest, and for every Top Pick generates a 1-page LaTeX summary card + a short podcast episode. Cards land in digests/YYYY-MM-DD/, podcasts in podcasts/YYYY-MM-DD/. The relevance threshold is configurable via config.yaml or --threshold.

To score against a specific project instead of your general interests:

/arxiv-daily ./path/to/your/project/details

To send output to a custom directory (paper cards and digests will go into <path>/YYYY-MM-DD/):

/arxiv-daily --output ~/research/paper-cards
/arxiv-daily ./my-project/README.md --output ~/Dropbox/papers

For example:

/arxiv-daily ~/Desktop/my-thesis/notes.tek

You can also search arXiv for a specific topic with /arxiv-search:

/arxiv-search "transformer scaling laws"

The LaTeX summary cards

Every paper card follows an identical template, defined in .claude/commands/paper-analysis.md. This makes them scannable and easy to collect over time.

The current template follows the structure:

Header — title, authors, year, venue, URL, code repo
Contribution — 2-3 sentences on what's novel and what gap it fills
Method — 2-4 sentences on the approach
Core Equation(s) — 1-3 key equations with variable annotations (or "N/A" for empirical papers)
Relevance to My Project — 2-3 bullets, specific and actionable
Limitations — 2-3 bullets, both acknowledged and apparent

Hard 1-page limit. Cards are compiled to PDF with xelatex (falls back to pdflatex).

The Python CLI

arxiv_tool.py can also be used standalone. It outputs JSON — the Claude commands are what add scoring, analysis, and LaTeX generation on top.

python3 arxiv_tool.py daily                                        # fetch today's papers from your configured categories
python3 arxiv_tool.py daily --context ./README.md                  # use this file as scoring context instead of config interests
python3 arxiv_tool.py daily --output ~/research/cards              # store results in ~/research/cards/YYYY-MM-DD/
python3 arxiv_tool.py search "attention" --max 20                  # search arXiv by query, return up to 20 results
python3 arxiv_tool.py history --last 7                             # show papers fetched in the last 7 days
python3 arxiv_tool.py save --date 2026-03-02 --output ~/papers     # save digest to a custom directory

Notes

Deduplication: Papers are tracked in .history.jsonl by arxiv ID. You won't see the same paper twice across daily runs.
Weekend handling: Monday fetches look back 3 days automatically (arXiv doesn't publish on weekends).
LaTeX is required for cards: You need xelatex or pdflatex installed. On macOS: brew install --cask mactex-no-gui. The .tex files are always saved even if compilation fails.
Podcasts require NotebookLM: First-time setup needs a browser for Google auth:
```
playwright install chromium
notebooklm login
```
This opens a browser window to authenticate with your Google account. Credentials are stored locally. When a research context is provided, podcasts adapt to your expertise.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

arxiv-digest

How it works

Setup

Usage

The LaTeX summary cards

The Python CLI

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.claude		.claude
.gitignore		.gitignore
README.md		README.md
arxiv_tool.py		arxiv_tool.py
config.yaml		config.yaml
podcast_paper.py		podcast_paper.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

arxiv-digest

How it works

Setup

Usage

The LaTeX summary cards

The Python CLI

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages