From 4744cc079706b193426c72b21ac5c3f88935e4b8 Mon Sep 17 00:00:00 2001 From: Aryan Rangapur <154872205+aryanrangapur@users.noreply.github.com> Date: Sun, 5 Oct 2025 12:02:02 +0530 Subject: [PATCH] Update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 528b28f..d9ce54f 100644 --- a/README.md +++ b/README.md @@ -2,6 +2,7 @@ Welcome to Battle of LLMs, a project that evaluates the responses of various language models (LLMs) including ChatGPT-4, ChatGPT, Gemini, and Mistral. This project leverages conversational QA datasets from CoQA, DialFact, FaVIQ, and CoDAH for testing these LLMs, comparing their performance, and providing insights into their capabilities. +**PAPER: https://arxiv.org/abs/2405.18344** ## Table of Contents - [Introduction](#introduction)