kldivergence

Star

Here are 2 public repositories matching this topic...

BY571 / sft-kl-lora-trainer

Star

Custom trl.SFTTrainer that adds a KL divergence loss between a LoRA-adapted model and its base model.

fine-tuning sft fine-tuning-llm kldivergence lora-adapters

Updated Jul 25, 2025
Python

Abdelrahman-Amen / Active_Learning_with_different_Query_Strategies

Star

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

python entropy numpy cuda uncertainty margin activelearning pyto kldivergence

Updated Jan 23, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the kldivergence topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kldivergence topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kldivergence

Here are 2 public repositories matching this topic...

BY571 / sft-kl-lora-trainer

Abdelrahman-Amen / Active_Learning_with_different_Query_Strategies

Improve this page

Add this topic to your repo