Download the Flickr8k Dataset, and unzip into a flickr8k directory
python3 -m venv venv
. venv/bin/activate
pip install -r requirements.txt
python -m spacy download en_core_web_sm
python main.py
We provide both LSTM and GRU based models. Please see model.py and model_gru.py respectively.
Please see the results/ directory for epoch loss data in csv files. We've included
.ipynb notebooks for each model to analyze various metrics and run inference.
resnext_gru_eval_3_layer.ipynbresnext_lstm_eval_single_layer.ipynbresnext_lstm_eval_3_layer.ipynbresnext_gru_eval_single_layer.ipynb
We also have a notebook comparing the epoch losses of each model in model_comparison_graphs.ipynb.
eval.ipynb is provided as a reference template notebook for evaluating a model.
NOTE: .pt model files/weights are available upon request. We have NOT included
them in this repository due to the size of the model files.