Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions notebooks/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -88,8 +88,9 @@
title: Hyperparameter Optimization with Optuna and Transformers
- local: function_calling_fine_tuning_llms_on_xlam
title: Fine-tuning LLMs for Function Calling with the xLAM Dataset


- local: grpo_vllm_online_training
title: Efficient Online Training with GRPO and vLLM in TRL


- title: Computer Vision Recipes
isExpanded: false
Expand Down
17,209 changes: 17,209 additions & 0 deletions notebooks/en/grpo_vllm_online_training.ipynb

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion notebooks/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools

Check out the recently added notebooks:

- [Efficient Online Training with GRPO and vLLM in TRL](grpo_vllm_online_training)
- [Fine-tuning LLMs for Function Calling with the xLAM Dataset](function_calling_fine_tuning_llms_on_xlam)
- [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
- [TRL GRPO Reasoning with Advanced Reward](trl_grpo_reasoning_advanced_reward)
- [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)

You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).

Expand Down