dialogue_train_data

基于知识库的对话系统，训练数据，已经初步处理，可以直接使用。

介绍

本项目提供了一个基于知识库的对话系统的训练数据。
该数据是已经经过清洗、处理和构造，能直接用于训练“主题模型”，“知识匹配模型”和“”对话生成模型。
上述模型详见论文 Key Factors of Knowledge-driven Conversation System。

样例

{'history_utt': ['知道重庆森林这部电影吗？'], 'response': '知道呀，是一部由王家卫导演的片子', 'knowledge': '群众类型剧情', 'label': 0, 'mention': -100}
{'history_utt': ['知道重庆森林这部电影吗？'], 'response': '知道呀，是一部由王家卫导演的片子', 'knowledge': 'Crash类型剧情', 'label': 0, 'mention': -100}
{'history_utt': ['知道重庆森林这部电影吗？', '知道呀，是一部由王家卫导演的片子'], 'response': '而主演里更是有王菲，一上映便受到追捧', 'knowledge': '重庆森林主演王菲', 'label': 1, 'mention': '重庆森林'}
{'history_utt': ['知道重庆森林这部电影吗？', '知道呀，是一部由王家卫导演的片子'], 'response': '而主演里更是有王菲，一上映便受到追捧', 'knowledge': '重庆森林类型剧情，文艺', 'label': 0, 'mention': -100}

说明

history_utt：为对话历史。

response：下一句的合适回复。

knowledge：候选知识。

label：0 表示该条候选知识为负样本；1 表示该候选知识为正样本。

mention：对话主题，当为 -100，表示候选知识不正确，故不需要主题。

原始数据来源。

原始数据为 KdConv，更多数据的详细信息请见该描述
若有侵权，请联系作者删除。

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
kb		kb
train		train
valid		valid
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dialogue_train_data

介绍

样例

说明

原始数据来源。

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

dialogue_train_data

介绍

样例

说明

原始数据来源。

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages