GPT Training Tool

This repository contains a GPT Training Tool adapted from Andrej Karpathy's MiniGPT repository. The tool runs as an API and can build, train, test, store, and reload various GPT models. It will also generate a graph of your model architecture and provide a summary of its size. Note: Currently it is equipped to learn character encodings, but it can be easily updated to learn subword encodings.

Installation

Before using the GPT Training Tool, make sure to install the required packages:

pip install -r requirements.txt

Usage

To run the GPT Training Tool, follow these steps:

Start the Application

Run the main script:

python app.py

Endpoints

The application exposes the following endpoints:

\new_model: creates a new model using provided params
\load_model: loads the provided model from checkpoint
\save_model: saves the current model weights, hyperparameters, and logs
\get_model: gets the name of the currently loaded model
\view_model: shows a summary of the model size
\get_params: gets the list of the currently loaded params
\update_params: patches the params file, currently only accepts training params
\train: trains the currently loaded model
\evaluate: evaluates the current loss on the test data set
\generate: returns generation
\complete: returns completion from prompt

A postman collection with all of these endpoints set up is stored in configs.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
checkpoints		checkpoints
components		components
configs		configs
constants		constants
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT Training Tool

Installation

Usage

Start the Application

Endpoints

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

grayboywilliams/gpt_training_tool

Folders and files

Latest commit

History

Repository files navigation

GPT Training Tool

Installation

Usage

Start the Application

Endpoints

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages