Skip to content

How can I predict on my own dataset? #16

@p-null

Description

@p-null

Suppose I have a document and a question, I'd like to get the answer span and answer string.

What steps should I take to get what I want?

(I tried to format it as multiqa format, that is like

js_obj = [{"id": "HotpotQA_5a85ea095542994775f606a8",
"context": {
  "documents":[{"text": "passage_sentences"
   }
              ]
        },
"qas":["question_sentence?"]}]

and dump it to test.gz and use predict like
python predict.py --model https://multiqa.s3.amazonaws.com/models/BERTBase/SQuAD1-1.tar.gz --dataset test.gz --dataset_name SQuAD --cuda_device 0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions