MathBeaver Fine-tuning Setup Guide

Overview

This project fine-tunes the Phi-4-reasoning-plus model with SOS chain of thought training data (from Bohan) using the LoRA through Axolotl.

Colab Link to QA testing with trained model

Prerequisites

RunPod instance with enough disk memory
SSH access to RunPod
Python 3.11 or higher
Access to training data (Google Drive folder)

Setup Instructions

1. Initial Setup

SSH into your RunPod instance and navigate to the workspace:

# Connect to RunPod
ssh runpod-tcp

# CD to workspace
cd /workspace

# Git clone repo
cd mathbeaver-finetune
cd into "data" folder
pip install gdown

2. Clone Shivam's repo (Richard's branch)

git clone https://github.com/Shivamshaiv/mathbeaver-finetune.git

3. Download files from Bohan's drive

# download training data into "data" in the "Data_SOS_Cot" folder
# to get the first 50 folders from Bohan's google drive of SOS training data
gdown --folder https://drive.google.com/drive/folders/1E1tHwS7YQOajZcjWsMXpTaPdRZm9jYcC --remaining-ok

4. Set up environment

conda create -n phi-tuning python=3.11
conda init
# restart shell
conda activate phi-tuning

#install axolotl
pip install axolotl

5. Preprocess data to ChatML format

run python preprocess_data_chatml.py

6. Configure config file (with small number of examples)

Run training script: python run_training.py --config config_test.yaml

7. Results

Output saved to outputs directory

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
QA_finetuning_test.ipynb		QA_finetuning_test.ipynb
README.md		README.md
adapter_config.json		adapter_config.json
sample_training_data.json		sample_training_data.json
special_tokens_map.json		special_tokens_map.json
tokenizer.json		tokenizer.json
tokenizer_config.json		tokenizer_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MathBeaver Fine-tuning Setup Guide

Overview

Colab Link to QA testing with trained model

Prerequisites

Setup Instructions

1. Initial Setup

2. Clone Shivam's repo (Richard's branch)

3. Download files from Bohan's drive

4. Set up environment

5. Preprocess data to ChatML format

6. Configure config file (with small number of examples)

7. Results

About

Uh oh!

Releases

Packages

Languages

richardhoff88/mb_finetuning_notes

Folders and files

Latest commit

History

Repository files navigation

MathBeaver Fine-tuning Setup Guide

Overview

Colab Link to QA testing with trained model

Prerequisites

Setup Instructions

1. Initial Setup

2. Clone Shivam's repo (Richard's branch)

3. Download files from Bohan's drive

4. Set up environment

5. Preprocess data to ChatML format

6. Configure config file (with small number of examples)

7. Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages