This repository contains the code, models, and data accompanying our paper on adjusting text difficulty of messages generated by open-domain chatbots.
Tyen, G., Brenchley, M., Caines, A., & Buttery, P. (2022). Towards an open-domain chatbot for language practice. 17th Workshop on Innovative Use of NLP for Building Educational Applications.
- ParlAI 1.6.0
- PyTorch 1.10.2
- Huggingface Transformers 4.16.2
- NumPy 1.22.2
- SciPy 1.8.0
- Regex 2022.1.18
- Download and cd to project directory
git clone https://github.com/WHGTyen/ControllableComplexityChatbotcd ControllableComplexityChatbot - Install pip dependencies
pip install numpy scipy regex torch transformers - Clone ParlAI repository
git clone https://github.com/facebookresearch/ParlAI.git --branch 1.6.0 - Setup ParlAI
cd ParlAI; python setup.py develop; cd .. - To run the demo:
python demo.py - To adjust generation parameters, edit values in
demo.py
sample_wordlist.txtis the 5000 most frequent words from this listfilter.txtwas taken from this listcomplexity_modelwas trained on data from the Cambridge Exams readability dataset, found here
This paper reports on research supported by Cambridge University Press & Assessment. This work was performed using resources provided by the Cambridge Service for Data Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service, provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/P020259/1), and DiRAC funding from the Science and Technology Facilities Council.