SavvyLingua is a Dockerized Flask web application that allows users to upload documents in underpreserved language (Jeju), extract text using Tesseract OCR, and store results in a database. The extracted text can be viewed and downloaded as JSON.
- Upload
.png,.jpg,.jpeg,.pdffiles - Extract text using OCR
- Store files and OCR results in database
- View public documents
- Download OCR results as
.json
git clone https://github.com/Nitophera/SavvyLingua
cd SavvyLinguadocker-compose up --build