OCRs end credits from video files and renames them using Google’s Gemini models. RollCall samples the tail of each video, extracts and aggregates film credits, then uses Gemini to determine the title of the media.
Requirements
- Python ≥ 3.10
- FFmpeg (
ffmpeg,ffprobe) on yourPATH - A Gemini API key (
GEMINI_API_KEYpreferred;GOOGLE_API_KEYalso supported)
Steps
# from project root (where pyproject.toml lives)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
python -m pip install -U pip wheel
python -m pip install -e .
# set your key (bash/zsh)
export GEMINI_API_KEY="your-key-here"
# PowerShell:
# $env:GEMINI_API_KEY="your-key-here"Dry run (no changes):
rlcl --dry-run /path/to/mediaRename for real:
rlcl /path/to/mediaUseful options:
# sample more frames (slower, higher recall)
rlcl --fps "1/2" /path/to/media
# add a small delay between OCR calls
rlcl --ocr-delay 0.2 /path/to/media
# disable grounded web search (local refine only)
rlcl --no-search /path/to/media
# override the model/API key for this run
rlcl --model gemini-2.5-flash --api-key "$GEMINI_API_KEY" /path/to/mediaNotes
- Supports
.mp4,.mkv,.avi,.mov. - Project is unreleased and subject to change.