Make sure you have Python 3.11 installed.
Create a virtual environment, activate it, install the dependencies, and add the project to the Python path:
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export PYTHONPATH=${PYTHONPATH}:./Reproducing the exact results from the paper requires the following artifacts:
llm_cache.zipthe LLM API requests and responses, which you must unpack intodata/llm_cacheAITQA.zipthe AIT-QA dataset, with de-contextualized tables, which you must unpack intodata/AITQANQTables.zipthe NQ-Tables dataset, split into easy and hard questions, which you must unpack intodata/NQTables
To reproduce the results from the paper, run:
bash reproduce.shThe results are:
- Table 1 (Table question answering results):
data/tqa_AITQA.csvanddata/tqa_NQTables.csv - Table 2 (Table retrieval results):
data/retrieval_AITQA.csvanddata/retrieval_NQTables.csv - Table 3 (Zoom retrieval ablations):
data/zoom_AITQA.csvanddata/zoom_NQTables.csv - Table 4 (Prompt template ablations):
data/template_AITQA.csvanddata/template_NQTables.csv - Table 5 (CSV linearization ablations):
data/linear_AITQA.csvanddata/linear_NQTables.csv - Table 6 (Metadata during retrieval, not in the paper):
data/metadata_AITQA.csvanddata/metadata_NQTables.csv