Task-Based Conversational Agent Using Seq2Seq and Retrieval

Implementation

A chatbot based on recurrent neural networks (RNN). Encoder-decoder long short-term memory (LSTM) model has been used. The Encoder-Decoder LSTM is a recurrent neural network designed to address sequence-to-sequence problems. The chatbot has been built with Keras (TensorFlow backened).

This work is advised by Prof. Ashwin Srinivasan, BITS Pilani. K. K. Birla Goa Campus.

Execution

Run the file preprocessing.py to preprocess the training data. This will also create three pickle files for training, testing and both.
Run train.py to train the model. This will also save the model in a .hdf5 format.
Run test.py for an instance of the chatbot.

Dependencies include:

numpy
keras
tensorflow

Dataset generation

There is no publicly available dataset for the task of a library-based conversational agent or for the case of book recomenndation systems either. Hence the dataset is generated from a template of variations of natural language which can be used when a student interacts with such a conversational agent.

Weights based on a random-normal distribution are applied to the options for a particular conversation to make sure that the dataset isnt balanced.

For example for greetings, the weights are divided between “Hello, how is it going”, “Good day. How can I assist you”, “Hi, what do you need help with”, etc. Similar process is followed for the user’s responses to the bot, such as variations of “May I have a book in networks”, “I am looking for some help in Machine Learning”, “Can you suggest something in Computer Architecture”, etc to model it as close to natural language as possible and cover all possibilities of how a user may interact with a bot.

The bot is currently trained on 14 topics from the ACM CS subjects which it is able to correctly predict.

An example iteration of the chatbot.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Images		Images
01_preprocessing.py		01_preprocessing.py
02_training.py		02_training.py
03_chatting.py		03_chatting.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Task-Based Conversational Agent Using Seq2Seq and Retrieval

Implementation

Execution

Dependencies include:

Dataset generation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Task-Based Conversational Agent Using Seq2Seq and Retrieval

Implementation

Execution

Dependencies include:

Dataset generation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages