-
Download Anaconda Installer
- Visit the Anaconda download page.
- Choose the version for your operating system (Windows, macOS, or Linux).
-
Run the Installer
- Windows: Double-click the
.exefile. - macOS: Open the
.pkgfile. - Linux: Run
bash <Anaconda-installer-script.sh>in the terminal.
- Windows: Double-click the
-
Complete the Installation
- Follow the prompts. Select "Add Anaconda to my PATH environment variable" for ease of use.
-
Open Anaconda Navigator
- Find Anaconda Navigator in your applications.
-
Launch Jupyter Notebook
- Click 'Launch' under Jupyter Notebook in Anaconda Navigator.
-
Open Anaconda Prompt/Terminal
- Windows: Use Anaconda Prompt.
- macOS/Linux: Use the terminal.
-
Create a New Environment
- Run
conda create -n myenv python=3.7, where myenv is the name of your new environment..
- Run
-
Activate the Environment
- Run
conda activate myenv.
- Run
- Install Packages from
requirements.txt- Activate your environment (
conda activate myenv). - Navigate to your
requirements.txtfile (cd /path/to/directory). - Run
pip install -r requirements.txt.
- Activate your environment (
- To use pre-trained models for detecting poems on newspaper pages, run the notebook
detect_poem_page.ipynb. We recommend to use the LeNet-9 model.
- Step 1: Start by segmenting newspaper pages into individual snippet images using
segment_page.ipynb. - Step 2: Categorize the segmented snippet images into two groups:
- Images containing poems.
- Images not containing poems.
- Step 3: Prepare lists of training and validation snippet images. It is advisable to use a ratio of 9:1 for this split.
- Step 4: Perform fine-tuning on the models using
fine_tune.ipynb. - Step 5: For poem detection using the fine-tuned models, run
detect_poem_page.ipynband modify the setting "finetuned" toTrue.
This project is part of the Aida project (projectaida.org) for which license information can be found on https://github.com/ProjectAida/aida.
- Liu, Yi, Leen-Kiat Soh, and Elizabeth Lorang. "Investigating coupling preprocessing with shallow and deep convolutional neural networks in document image classification." Journal of Electronic Imaging 30, no. 4 (2021): 043024-043024.
- Soh, Leen-Kiat, Elizabeth Lorang, and Yi Liu. "Aida: intelligent image analysis to automatically detect poems in digital archives of historic newspapers." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1. 2018.
Code in files, datagen_image.py and model_factory.py, is created by Yi Liu (email:yil at unl.edu) at Aida team at University of Nebraska-Lincoln. All rights are reserved.