VANESSA - Verifying the Steps of Deductive Reasoning Chains

Main use

python main.py TASK DATASET_NAME NLI_MODEL DATASET_VERSION

Datasets / Versions:

Tasks:

NLI_MODEL:

None (for parsing and consistency tasks. In consistency, will perform string matching)
Symbolic
Deberta
LLaMa3
Mistral
GPT 3.5 Turbo

Results are saved in results//_-<nli_model>-.jsonl

You can find our reported results in results/reasoning (for validity) and results/consistency (for groundedness)

results/analysis provides scripts to get metrics

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cache		cache
entailment		entailment
reasoner		reasoner
results		results
tregex_parsing		tregex_parsing
.gitignore		.gitignore
Readme.md		Readme.md
coref_utils.py		coref_utils.py
entailment_model.py		entailment_model.py
hf_key		hf_key
main.py		main.py
negate.py		negate.py
oai_key		oai_key
requirements.txt		requirements.txt
utils.py		utils.py