GPT-2

A decoder-only, autoregressive transformer LM using GPT-2 architecture. A CS 224N final project.

Poster:

Getting Started

Set up a Conda environment

# get setup
conda env create -f env.yml
conda activate cs224n_dfp

# install dependencies
pip install -r requirements.txt

Featured use-cases:

Sentiment analysis
Paraphrase detection (via cloze-style classification)
Sonnet generation (via autoregressive LM)

Run training for:

# sentiment classification (full model)
python classifier.py –fine-tune-mode full-model –batch_size 128 –lr 1e-5 hidden_dropout_prob=0.1 –epochs=10

# sentiment classification (last linear layer)
python classifier.py –fine-tune-mode last-linear-layer –batch_size 64 –lr 1e-3 hidden_dropout_prob=0.1 –epochs=10

# paraphrase detection
python paraphrase_detection.py --use_gpu

# sonnet generation (dev)
python sonnet_generation.py --use_gpu --held_out_sonnet_path data/sonnets_held_out_dev.txt --sonnet_out predictions/generated_sonnets_dev.txt

# sonnet generation (with DPO)
python sonnet_generation.py --use_gpu --held_out_sonnet_path data/sonnets_held_out_dev.txt --sonnet_out predictions/generated_sonnets_dev.txt --dpo_mode

# sonnet generation (test)
python sonnet_generation.py --use_gpu

Background

GPT-2 is a large, transformer-based language model that generates text via predicting the next word given context. We focus on building a smaller version of GPT-2 from scratch, focusing on its architecture (e.g. multi-head self-attention, position-wise feed-forward networks, byte-pair encoding for tokenization). Our model is also designed for both generative and classification tasks.

Developers

Aaron Jin

Brandon Bui

Eli Wandless

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
data		data
instructions		instructions
models		models
modules		modules
predictions		predictions
report		report
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
classifier.py		classifier.py
config.py		config.py
datasets.py		datasets.py
env.yml		env.yml
evaluation.py		evaluation.py
optimizer.py		optimizer.py
optimizer_test.npy		optimizer_test.npy
optimizer_test.py		optimizer_test.py
paraphrase_detection.py		paraphrase_detection.py
prepare_submit.py		prepare_submit.py
requirements.txt		requirements.txt
sanity_check.py		sanity_check.py
setup.sh		setup.sh
sonnet_generation.py		sonnet_generation.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2

Getting Started

Background

Developers

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

aaronkjin/gpt2

Folders and files

Latest commit

History

Repository files navigation

GPT-2

Getting Started

Background

Developers

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages