GitHub - jsung1997/NLP_MedQA: Quantized Low-Rank Adaptation LLM for MedQA

Efficient Low-Level Medical Question Answering (NLP Project)

Quantized LoRA Fine-Tuned LLM for MedQA This project implements a Quantization + Low-Rank Adaptation (QLoRA) pipeline for fine-tuning a lightweight large language model (LLM) on the MedQA dataset. The goal is to train a parameter-efficient, domain-adapted model capable of answering medical examination-style questions under strict hardware constraints (single GPU).

Objective: Fine-tune a compact, quantized LLM that maintains high accuracy on medical QA reasoning tasks while dramatically reducing GPU memory usage and training cost.

Key Features:

🔹 4-bit Quantization (bitsandbytes) — Compresses model weights to 4-bit without major performance loss. 🔹 LoRA Adapters (PEFT) — Injects small trainable matrices into attention layers for efficient adaptation. 🔹 Task-Specific Fine-Tuning (MedQA) — Adapts the base LLM to medical question-answering, focusing on reasoning and terminology. 🔹 Low GPU Memory Usage — Fine-tunes 7B-class models on a single RTX 3060 / 3090 GPU. 🔹 Evaluation Metrics — Reports accuracy, loss, and reasoning trace for each validation step.

Base Model (4-bit) ---> LoRA Injected Layers ---> Fine-Tuning on MedQA │ │ ▼ ▼ Quantization (bitsandbytes) Adaptation (PEFT)

Dataset:

MedQA (USMLE-format medical exam) Multiple-choice medical questions Domain: physiology, pathology, pharmacology, diagnosis Data split: train / validation / test Tokenized using SentencePiece or LLaMA tokenizer

Model	Params	Precision	GPU	Accuracy (MedQA)	Memory
Base LLaMA-2-7B	7B	FP16	RTX 3090	43%	~28 GB
QLoRA Fine-Tuned	7B	4-bit + LoRA	RTX 3060	47%	<8 GB

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
docs		docs
tatoeba_en_ko.txt		tatoeba_en_ko.txt
AgentType.py		AgentType.py
MODEL_TEST.py		MODEL_TEST.py
README.md		README.md
build_faiss_index.py		build_faiss_index.py
convert_to_safetensors.py		convert_to_safetensors.py
debug_overfit.py		debug_overfit.py
eng_kor_data.csv		eng_kor_data.csv
eng_to_kor_distill_nllb.py		eng_to_kor_distill_nllb.py
medqa_index.faiss		medqa_index.faiss
medqa_test.py		medqa_test.py
medqa_texts.pkl		medqa_texts.pkl
prepare_tatoeba_csv.py		prepare_tatoeba_csv.py
rag.py		rag.py
requirements.txt		requirements.txt
retriever.py		retriever.py
teacher_sanity_check.py		teacher_sanity_check.py
train_eval_kormedmcqa.py		train_eval_kormedmcqa.py
train_medqa.py		train_medqa.py
train_medqa_v2.py		train_medqa_v2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

jsung1997/NLP_MedQA

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages