🏯 Manga AI Translator

An automated, privacy-focused, GPU-accelerated pipeline to translate manga and comics locally.

This project aims to provide a full-stack solution (Frontend, Backend, and AI Worker) to detect text bubbles, perform OCR, translate contextually using LLMs, and typeset the result back into the original image—all without external APIs or recurring costs.

🏗️ Architecture

The project follows a Microservices architecture to ensure the heavy AI processing doesn't block the web server.

🧩 Project Structure

Module	Status	Description
`/ai-worker`	✅ v10.0	The core Python engine. Handles Computer Vision, OCR, and LLM Inference on GPU.
`/backend-api`	🚧 Planned	High-performance API (Go/NestJS) to handle uploads, queues, and file serving.
`/frontend`	🚧 Planned	Modern Web UI (React) for drag-and-drop uploads and reading translated chapters.

✨ Key Features (AI Worker V10)

The core engine is currently fully operational.

📊 Perfs (RTX 2060 12GB):

29 pages/minute
~1,700 pages/hour
Batch processing (.zip native)
⚡ 100% Local & Uncensored: Powered by llama.cpp and Abliterated models. No moralizing, just translation.
👁️ Smart Detection: Uses YOLOv8 fine-tuned on Manga109 to detect speech bubbles.
- Smart Box Merging automatically consolidates fragmented vertical text bubbles.
📖 Specialized OCR: Uses MangaOCR to handle vertical Japanese text and handwritten fonts.
🧠 Context-Aware Translation:
- Uses Qwen 2.5 7B (Instruction tuned).
- Custom prompt engineering to handle "Subject-less" Japanese sentences.
- "Anti-Thinking" regex filters to remove internal LLM monologues.
🎨 Advanced Typesetting:
- NEW (V10): Intelligent Masked Inpainting - Uses OpenCV threshold detection and cv2.inpaint to remove ONLY dark text pixels, preserving artwork and backgrounds even when bounding boxes overlap.
- Pixel-Perfect Wrapping: Custom algorithm measuring exact pixel width of words to avoid overflow.
- Sanitization: Filters out unsupported characters (emojis, math symbols) to prevent font rendering glitches.
📦 Batch Processing: Native support for .zip archives (extract → translate → repack).
🏗️ Modular Architecture: Clean, maintainable codebase with separation of concerns for easy customization and extension.

📸 Examples

See the V10 intelligent masked inpainting in action! These examples showcase the ability to preserve artwork while cleanly removing text.

Example 1: Naruto

Original (Japanese)

Translated (English)

Example 2: One Piece

Original (Japanese)

Translated (English)

V10 Improvements Demonstrated:

Clean text removal without damaging background artwork
Preserved bubble borders and shading
Accurate text positioning and sizing
No artifacts in overlapping bubble regions

🚀 Getting Started (Worker Only)

Currently, you can run the worker as a CLI tool.

Prerequisites

NVIDIA GPU with 6GB+ VRAM (Recommended: 8GB+).
CUDA Toolkit 12.x installed.
Python 3.10+.

Setup

Navigate to the worker directory:

cd ai-worker

Install dependencies (ensure CUDA support):

pip install -r requirements.txt

See inner README for detailed llama-cpp-python compilation instructions.

Run on an image or a zip file:

python main.py ../my_manga_chapter.zip

🗺️ Roadmap

Core AI Pipeline (Detection, OCR, Translation, Inpainting)
GPU Optimization (VRAM management, 4-bit quantization)
Smart Typesetting (Pixel wrapping, box merging)
Modular Code Architecture (Config, Services, Utils separation)
Backend API (Go/NestJS setup, Redis integration)
Frontend UI (React, File upload zone, Gallery)
Docker Compose (One command deployment)

🤝 Credits

Models: Qwen (Alibaba Cloud), YOLOv8 (Ultralytics), MangaOCR (kha-white).
Tech: Llama.cpp, PyTorch, Pillow.

Current Version: V10 (Stable) - Intelligent Masked Inpainting

See CHANGELOG for detailed version history.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
ai-worker		ai-worker
backend-api		backend-api
docs		docs
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🏯 Manga AI Translator

🏗️ Architecture

🧩 Project Structure

✨ Key Features (AI Worker V10)

📸 Examples

Example 1: Naruto

Example 2: One Piece

🚀 Getting Started (Worker Only)

Prerequisites

Setup

🗺️ Roadmap

🤝 Credits

About

Uh oh!

Languages

P4ST4S/AutoScanlate-AI

Folders and files

Latest commit

History

Repository files navigation

🏯 Manga AI Translator

🏗️ Architecture

🧩 Project Structure

✨ Key Features (AI Worker V10)

📸 Examples

Example 1: Naruto

Example 2: One Piece

🚀 Getting Started (Worker Only)

Prerequisites

Setup

🗺️ Roadmap

🤝 Credits

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages