hn-daily

Hacker News daily digest fetcher with Jina Reader and crawl4ai. Fetches top stories from yesterday, crawls content and comments, saves to markdown.

Features

Fetches top stories from Hacker News (yesterday) via Algolia API (default 10, configurable)
Fetches article markdown with Jina Reader first for external URLs, then falls back to local crawling
Crawls story content and comments using crawl4ai
Saves story markdown files to drafts/ (configurable via --output)
Daily digest posts are stored in daily/ as daily/YYYY/MM/YYYY-MM-DD.md for the Hugo site
Rich CLI output with progress tracking

Installation

pip install -r requirements.txt
python -m playwright install chromium

# Optional: improve Reader throughput and limits
export JINA_API_KEY=your_api_key

Usage

# Run with defaults (yesterday's top 15 stories)
python -m hn_daily

# With options
python -m hn_daily --date 2025-01-19 --limit 15 --output my_drafts

Output

Markdown files are saved to drafts/ with format:

{story_title}_{YYYYMMDD}.md

Each file contains:

Story metadata (author, points, URL, date)
Crawled content
Comments section

Project Structure

hn-daily/
├── hn_daily/
│   ├── cli.py              # CLI entry point
│   ├── models.py           # Story, Comment, CrawlResult
│   └── services/
│       ├── story_service.py    # Fetch from HN API
│       ├── comment_service.py  # Fetch comments
│       ├── crawler_service.py  # crawl4ai integration
│       └── storage_service.py  # Save to markdown
├── tests/
├── drafts/
├── requirements.txt
└── pyproject.toml

Requirements

Python 3.10+
Playwright browsers (python -m playwright install chromium)

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.claude		.claude
.github/workflows		.github/workflows
agent/.claude		agent/.claude
daily/2026		daily/2026
hn_daily		hn_daily
site		site
tests		tests
.gitignore		.gitignore
.mise.toml		.mise.toml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
__main__.py		__main__.py
history.json		history.json
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hn-daily

Features

Installation

Usage

Output

Project Structure

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

hn-daily

Features

Installation

Usage

Output

Project Structure

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages