Minor fixes to run some of your codes by yoshitomo-matsubara · Pull Request #3 · huminghao16/RE3QA

yoshitomo-matsubara · 2019-07-11T17:32:28Z

Hello,

First of all, thank you for sharing your code!

I tried to reproduce results in your paper, and found there are two different "data" folders (one at user home, one at the same level of src folder), which caused some errors like FileNotFoundError

With my minor fixes (including README.md) and the following scripts, there will be only one data folder (at the same level of src folder), and we can complete tasks for SQuAD-doc, TriviaQA-wiki, and TriviaQA-open.

# Python 3.6 and torch==1.1.0 were used
export DATA_DIR=data/squad
export BERT_DIR=bert-base-uncased

python -m bert.run_squad_document_full_e2e \
  --vocab_file $BERT_DIR/vocab.txt \
  --bert_config_file $BERT_DIR/bert_config.json \
  --init_checkpoint $BERT_DIR/pytorch_model.bin \
  --do_train \
  --do_predict \
  --data_dir $DATA_DIR \
  --train_file train-v1.1.json \
  --predict_file dev-v1.1.json \
  --train_batch_size 16 \
  --learning_rate 3e-5 \
  --num_train_epochs 2.0 \
  --output_dir out/squad_doc/01

python -m bert.run_squad_document_full_e2e \
  --vocab_file $BERT_DIR/vocab.txt \
  --bert_config_file $BERT_DIR/bert_config.json \
  --do_predict_open \
  --data_dir $DATA_DIR \
  --output_dir out/squad_doc/01

python -m triviaqa.evidence_corpus --n_processes 8 --max_tokens 200
python -m triviaqa.build_span_corpus wiki --n_processes 8
python -m triviaqa.build_span_corpus unfiltered --n_processes 8

python -m triviaqa.ablate_triviaqa_wiki --n_processes 8 --n_para_train 12 --n_para_dev 14 --n_para_test 14 --do_train --do_dev --do_test
python -m triviaqa.ablate_triviaqa_unfiltered --n_processes 8 --n_para_train 12 --n_para_dev 14 --n_para_test 14 --do_train --do_dev --do_test
cp data/triviaqa/qa/wikipedia-dev.json data/triviaqa/wiki/
cp data/triviaqa-unfiltered/unfiltered-web-dev.json data/triviaqa/unfiltered/

export DATA_DIR=data/triviaqa/wiki
export BERT_DIR=bert-base-uncased

python -m bert.run_triviaqa_wiki_full_e2e  \
  --vocab_file $BERT_DIR/vocab.txt \
  --bert_config_file $BERT_DIR/bert_config.json \
  --init_checkpoint $BERT_DIR/pytorch_model.bin \
  --do_train \
  --do_dev \
  --data_dir $DATA_DIR \
  --train_batch_size 16 \
  --learning_rate 3e-5 \
  --num_train_epochs 2.0 \
  --output_dir out/triviaqa_wiki/01

export DATA_DIR=data/triviaqa/unfiltered
export BERT_DIR=bert-base-uncased

python -m bert.run_triviaqa_wiki_full_e2e  \
  --vocab_file $BERT_DIR/vocab.txt \
  --bert_config_file $BERT_DIR/bert_config.json \
  --init_checkpoint $BERT_DIR/pytorch_model.bin \
  --do_train \
  --do_dev \
  --data_dir $DATA_DIR \
  --dev_file unfiltered-web-dev.json \
  --train_batch_size 16 \
  --learning_rate 3e-5 \
  --num_train_epochs 2.0 \
  --output_dir out/triviaqa_unfiltered/01

Note: At first I tried to set 32 at train_batch_size for both models as suggested, but faced CUDA out of memory error even with 4 GPUs (each of them has 16GB video memory), so used 16 as train_batch_size

Regardless of my minor fixes, I faced a different error for SQuAD-open, but couldn't fix it shortly.
I will submit issues with some additional questions, and put the links here later.

Thank you!

Arjunsankarlal · 2019-07-11T17:37:54Z

Hey there @yoshitomo-matsubara that's a great work! it would be very helpful if you could upload the trained models.

yoshitomo-matsubara · 2019-07-11T18:16:05Z

Hey there @yoshitomo-matsubara that's a great work! it would be very helpful if you could upload the trained models.

Hi @Arjunsankarlal
I don't have trained models to reproduce the reported results, sorry

Hi @huminghao16
#4 and #5 are my additional questions, thanks!

yoshitomo-matsubara added 6 commits July 8, 2019 23:36

fixed typos and bugs

f8bd2f9

tried different parent dir name

b751309

fixed a typo

dcf77fa

fixed a typo

58ca98d

fixed a typo

9bb67f4

fixed bugs and updated README

dd6e8a3

This was referenced Jul 11, 2019

Error in SQuAD-open task #4

Open

Some questions to reproduce reported results #5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor fixes to run some of your codes#3

Minor fixes to run some of your codes#3
yoshitomo-matsubara wants to merge 6 commits intohuminghao16:masterfrom
yoshitomo-matsubara:master

yoshitomo-matsubara commented Jul 11, 2019 •

edited

Loading

Uh oh!

Arjunsankarlal commented Jul 11, 2019

Uh oh!

yoshitomo-matsubara commented Jul 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yoshitomo-matsubara commented Jul 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Arjunsankarlal commented Jul 11, 2019

Uh oh!

yoshitomo-matsubara commented Jul 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yoshitomo-matsubara commented Jul 11, 2019 •

edited

Loading