Make sure you have Python 3.11 installed.
Create a virtual environment, activate it, install the dependencies, and add the project to the Python path:
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export PYTHONPATH=${PYTHONPATH}:./Reproducing the exact results from the paper requires the following artifacts:
- llm_cache.zipthe LLM API requests and responses, which you must unpack into- data/llm_cache
- AITQA.zipthe AIT-QA dataset, with de-contextualized tables, which you must unpack into- data/AITQA
- NQTables.zipthe NQ-Tables dataset, split into easy and hard questions, which you must unpack into- data/NQTables
To reproduce the results from the paper, run:
bash reproduce.shThe results are:
- Table 1 (Table question answering results): data/tqa_AITQA.csvanddata/tqa_NQTables.csv
- Table 2 (Table retrieval results): data/retrieval_AITQA.csvanddata/retrieval_NQTables.csv
- Table 3 (Zoom retrieval ablations): data/zoom_AITQA.csvanddata/zoom_NQTables.csv
- Table 4 (Prompt template ablations): data/template_AITQA.csvanddata/template_NQTables.csv
- Table 5 (CSV linearization ablations): data/linear_AITQA.csvanddata/linear_NQTables.csv
- Table 6 (Metadata during retrieval, not in the paper): data/metadata_AITQA.csvanddata/metadata_NQTables.csv