This is the repo which records the evolution of LM-based dialogue system. We list works in each stage, and will constantly update it, welcome to raise a issue to add new works!!
- Task-oriented Dialogue System (TOD)
- Natural Language Understanding (NLU)
- Dialogue State Tracking (DST)
- Dialogue Policy Learning (DST)
- Natural Language Generation (NLG)
- End-to-End TOD (E2E TOD)
- Open-domain Dialogue System (ODD)
- Unified Dialogue System (UniDS)
- LLM-based Dialogue System (Conversational Agent)
- 8 Nov, 2024: We are happy to see renaissance of task-oriented dialogue system, which inspires lots of recent work, such as
$\tau$ -Bench, AppBench and so on.
-
A Survey of Language Model-based Dialogue System π₯π₯π₯π₯π₯ we write a blog for better understanding:
TODODDPLMLLMclick here. -
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions
E2E TODEMNLP 2023π₯π₯π₯ -
Recent advances in deep learning based dialogue systems: a systematic survey
Artificial Intelligence Review 2023π₯π₯π₯ -
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
DPLMachine Intelligence Research 2023π₯ -
A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects
ODDIJCAI 2023π₯π₯ -
Let's Negotiate! A Survey of Negotiation Dialogue Systems
ODDArxiv 2022 -
Recent advances and challenges in task-oriented dialog systems
TODSCTC 2020 -
Challenges in Building Intelligent Open-domain Dialog Systems
ODDTOIS 2020 -
A Survey on Dialogue Systems: Recent Advances and New Frontiers
TODODDSIGKDD 2017
-
INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
TACL 2022 -
DuLeMon: Long Time No See! Open-Domain Conversation with Long-Term Persona Memory
ODDACL 2022 -
FoCus: Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge
ODDAAAI 2022 -
SIMMC 2.0: Situated Interactive Multimodal Conversational AI
multi-modal -
KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation
ODDACL 2020[code]
- Eliza, Alice, GUS
-
End-to-End Learning of Task-Oriented Dialogs
E2E TODNAACL 2018first E2E TOD -
Assigning Personality/Profile to a Chatting Machine for Coherent Conversation Generation
ODDIJCAI 2018
-
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
ODDEMNLP 2023π₯π₯ -
Re3Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training
ODDEMNLP 2023π₯π₯π₯ -
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning
DSTEMNLP 2023T5 model -
Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue
ODDEMNLP 2023BART, T5 -
Turn-Level Active Learning for Dialogue State Tracking
RLEMNLP 2023 -
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning
DPLTODArxiv 2023 -
Retrieval-free Knowledge Injection through Multi-Document Traversal for Dialogue Models
ODDACL 2023 -
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning
ODDArixv 2022 -
Integrating Pretrained Language Model for Dialogue Policy Evaluation
DPLTODICASSP 2022π₯π₯π₯ first work of RLAIF in DPL -
Personalized Dialogue Generation with Persona-Adaptive Attention
ODDAAAI 2023
-
Modularized Pre-Training for End-to-End Task-Oriented Dialogue
E2E TODTASLP 2023 -
PPTOD: Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
E2E TODACL 2022 -
Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching
E2E TODTACL 2021 -
MOSS: End-to-End Dialog System Framework with Modular Supervision
AAAI 2020first work for modular E2E TOD
-
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems
EMNLP 2023T5 model, related to Q-TOD -
Continual Dialogue State Tracking via Example-Guided Question Answering
EMNLP 2023T5 model -
Enabling Semi-Structured Knowledge Access via a Question-Answering Module in Task-oriented Dialogue Systems
QA -> TODCUI 2023 -
Q-TOD: A Query-driven Task-oriented Dialogue System
TOD -> ODDEMNLP 2022 -
UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented Dialogues
ODD -> TODDialDoc 2022 -
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
TOD -> ODDArxiv 2022[Code]
-
LLaMA2-Chat Llama 2: Open Foundation and Fine-Tuned Chat Models
Arxiv 2023 -
Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions
Arxiv 2023multi-turn instruction-tuning data construction -
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
EMNLP 2023 -
BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Arxiv 2022 -
Pangu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained Langauge Model
Arxiv 2022
- Improving the Robustness of Knowledge-Grounded Dialogue via Contrastive Learning target noises in the input such as kg or misspelling in the query
ODD
-
Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue content planning similar with TPE
-
COOPER: Coordinating Specialized Agents towards a Complex Dialogue Goal
AAAI 2024multi-agent cue-cotοΌ -
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
AAAI 2024 -
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
ODDEMNLP 2023π₯π₯π₯ linguistic cues -
Symbolic Planning and Code Generation for Grounded Dialogue
TODEMNLP 2023[code] interesting -
Scalable-DSC: A Structural Template Prompt Approach to Scalable Dialogue State Correction
EMNLP 2023 -
Mirages: On Anthropomorphism in Dialogue Systems
ODDEMNLP 2023linguistic cues -
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning
TODDPLEMNLP 2023
-
Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration
ODDEMNLP 2023 -
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
ODDEMNLP 2023 -
Rethinking Conversational Agents in the Era of LLMs: Proactivity, Non-collaborativity, and Beyond
ODDSIGIR-AP 2023
-
EmoBench: Evaluating the Emotional Intelligence of Large Language Models
-
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation
EMNLP 2023 -
Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation
ODDEMNLP 2023 -
[Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements]
-
$\tau$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
language agenttask-oriented dsπ₯π₯π₯ -
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue using memory/persona as external sources
-
SAFARI: Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues
ODDEMNLP 2023π₯π₯π₯π₯π₯ related work: Self-RAG, ToolkenGPT, RAG. dependency between different sources -
ChatCoT: Tool-augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
EMNLP 2023 -
Towards LLM-driven Dialogue State Tracking
DSTEMNLP 2023instruction-tuning -
Multi-Source Multi-Type Knowledge Exploration and Exploitation for Dialogue Generation
EMNLP 2023 -
PLUG-AND-PLAY POLICY PLANNER FOR LARGE LANGUAGE MODEL POWERED DIALOGUE AGENTS
Arxiv 2023 -
[Reinforcement Learning for Optimizing RAG for Domain Chatbots]
AAAI 2024 Workshopusing rl to determine whether or not to retrieve for domain chatbots -
Are LLMs All You Need for Task-Oriented Dialogue?
TODSIGDIAL 2023all sub tasks
-
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
ODDArxiv 2023[Code] -
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
ODDACL 2023
-
PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue
EMNLP 2023dependency between different sources -
Large Language Models Meet Harry Potter: A Dataset for Aligning Dialogue Agents with Characters
DatasetEMNLP 2023 -
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
EMNLP 2023 -
What, When, and How to ground: Designing User Persona-Aware Conversational Agents for Engaging Dialogue
ACL 2023 Industry -
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning π₯
EMNLP 2023offline reinforcement learning -
Partner Personas Generation for Dialogue Response Generatio
NAACL 2022reinforcement learning
-
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers
-
Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
-
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages code
-
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
EMNLP 2023 -
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
EMNLP 2023
-
TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration
ODDArxiv 2023π₯π₯π₯π₯π₯ language agent, tool learning -
Learning Retrieval Augmentation for Personalized Dialogue Generation
EMNLP 2023 -
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
EMNLP 2023 -
Enhancing Task-oriented Dialogue Systems with Generative Post-processing Networks
EMNLP 2023 -
Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System
EMNLP 2023T5 and ChatGPT as generator, related to Q-TOD, Dual-Feedback?
-
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
TODdata augmentation -
Multi-User Chat Assistant (MUCA): a Framework Using LLMs to Facilitate Group Conversations
-
SELF-DIRECTED SYNTHETIC DIALOGUES AND REVISIONS TECHNICAL REPORT
-
TOOLFLOW: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis
- WHEELS: A conversational system in the automobile classifieds domain
- The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
- Artificial intelligence application in e-commerce: Transforming customer service, personalization and marketing
- National strategies on Artificial Intelligence
- Understanding the Determinants of Using Government AI-Chatbots by Citizens in Saudi Arabia
- https://www.tech.gov.sg/media/technews/govtech-team-behind-ask-jamie-government-chatbot
- https://insidegovuk.blog.gov.uk/2024/01/18/the-findings-of-our-first-generative-ai-experiment-gov-uk-chat/
- https://research.wealthfront.com/whitepapers/investment-methodology/
- Artificial intelligence in banking and financial services
- [Medical] Toward expert-level medical question answering with large language models
- [Education] https://www.khanmigo.ai/
- [Law] Lawyer GPT: A legal large language model with enhanced domain knowledge and reasoning capabilities
-
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction code π₯π₯π₯π₯π₯
-
Ο-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains π₯π₯
- [Law] https://donotpay.com/
- [Finance] Bloomberggpt: A large language model for finance
- [Finance] FinRobot: an open-source AI agent platform for financial applications using LLMs
- [Software Development] ChatDev: Communicative Agents for Software Development
- [Coding] https://copilot.microsoft.com/
- [GUI Agent] Gui agents with foundation models: A comprehensive survey
- [Replika] https://replika.com/
- [WoeBot] https://woebothealth.com/
- [Wysa] https://www.wysa.com/
- Toward large language models as a therapeutic tool: Comparing prompting techniques to improve gpt-delivered problem-solving therapy
-
AUTOREPLY: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies
EMNLP2023 -
[Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue]
Tuning Method
- https://www.promptingguide.ai/papers [prompting engineering papers]
- https://github.com/iwangjian/Paper-Reading#knowledge-grounded-dialogue
-
Beyond Candidates : Adaptive Dialogue Agent Utilizing Persona and Knowledge
-
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems
Welcome to cite our survey paper.
@misc{wang2023survey,
title={A Survey of the Evolution of Language Model-Based Dialogue Systems},
author={Hongru Wang and Lingzhi Wang and Yiming Du and Liang Chen and Jingyan Zhou and Yufei Wang and Kam-Fai Wong},
year={2023},
eprint={2311.16789},
archivePrefix={arXiv},
primaryClass={cs.CL}
}

