sharadbachani-oss

sharadbachani-oss

Popular repositories Loading

TruthfulQA TruthfulQA Public

Forked from sylinrl/TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook
s21mind s21mind Public

We split TruthfulQA into linguistically-detectable vs knowledge-required hallucinations. Result: 91.92% accuracy with ZERO parameters on the pattern-detectable subset. This establishes the first to…

Python
HexaMind HexaMind Public

We split TruthfulQA into linguistically-detectable vs knowledge-required hallucinations. Result: 91.92% accuracy with ZERO parameters on the pattern-detectable subset. This establishes the first to…

Python
alpaca_eval alpaca_eval Public

Forked from tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
hexagene hexagene Public

A Gene Decoding Engine

Python
s21theory s21theory Public