A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.
knowledge-distillation distillation large-languge-models on-policy-distillation cross-tokenizer-distillation
-
Updated
Mar 31, 2026 - Python