Skip to content

docs: document convert command and GPU pre-convert workflow#20

Merged
michaelalber merged 1 commit intomainfrom
docs/document-convert-command
Apr 27, 2026
Merged

docs: document convert command and GPU pre-convert workflow#20
michaelalber merged 1 commit intomainfrom
docs/document-convert-command

Conversation

@michaelalber
Copy link
Copy Markdown
Owner

Summary

  • Adds convert command to architecture diagram as an optional GPU pre-processing step
  • Documents all six CLI flags (--collection, --force, --dry-run, --no-ocr, path arg)
  • Explains sidecar workflow: convert once on GPU; ingest reads sidecars and skips Docling
  • Documents optional flash-attn / cuda_use_flash_attention2 acceleration for Ampere+ GPUs
  • Adds two troubleshooting rows: per-file crash isolation and slow conversion without flash-attn
  • Updates architecture prose from two-process to three-process description
  • Adds two feature bullets: GPU-accelerated pre-convert and crash-isolated batch conversion

Test plan

  • Verify rendered Markdown looks correct on GitHub (architecture diagram ASCII art, code blocks)
  • Confirm all convert CLI flags match __main__.py implementation

🤖 Generated with Claude Code

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@michaelalber michaelalber merged commit d892c8f into main Apr 27, 2026
10 checks passed
@michaelalber michaelalber deleted the docs/document-convert-command branch April 27, 2026 00:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant