Add supervised fine-tuning baseline implementation by Vicbi · Pull Request #4 · DaneshjouLab/ent-llm

Vicbi · 2025-09-25T00:29:52Z

Add supervised fine-tuning baseline implementation

♻️ Current Situation & Problem

This PR adds supervised fine-tuning (SFT) functionality for causal language models:

Addresses the need for instruction-tuning capabilities in the pipeline to fine-tune models on instruction-following tasks.
Enables both full fine-tuning and parameter-efficient training via LoRA.

⚙️ Release Notes

Added comprehensive supervised fine-tuning implementation with flexible training options.

Training Options: Supports both full fine-tuning and LoRA parameter-efficient training
Data Format: Expects JSONL files with instruction, optional input, and output fields.

Usage Examples:

# Full SFT
python sft_baseline.py --model gpt2 --train_file train.jsonl --eval_file dev.jsonl --out_dir ./sft_out

# LoRA training
python sft_baseline.py --model meta-llama/Llama-3.2-8B --train_file train.jsonl \
    --eval_file dev.jsonl --out_dir ./lora_out --use_lora \
    --lora_r 16 --lora_alpha 32 --lora_dropout 0.05

🚩 Next Steps

Curate and preprocess dataset for SFT.
Select target models for initial experiments.
Run benchmarking to validate implementation.

📝 Code of Conduct & Contributing Guidelines

By submitting this pull request, you agree to follow our Coding Guidelines:

I agree to follow the Coding Guidelines.

add baseline code for instruction finetuning: lora integration optional

9b6bbd0

Vicbi requested a review from joannalin22 September 25, 2025 00:32

joannalin22 added 4 commits September 24, 2025 23:17

Dataset with demographics

59fa7b4

Demographic data added to dataset

c452126

adding lib

8ca5a2f

Add ablation analysis and finetuning pipeline

0a4d579

Vicbi closed this Feb 16, 2026

Vicbi reopened this Feb 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add supervised fine-tuning baseline implementation#4

Add supervised fine-tuning baseline implementation#4
Vicbi wants to merge 5 commits intomainfrom
add-sft_training

Vicbi commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Vicbi commented Sep 25, 2025

Add supervised fine-tuning baseline implementation

♻️ Current Situation & Problem

⚙️ Release Notes

📝 Code of Conduct & Contributing Guidelines

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants