reinforcement-learning-from-ai-feedback

Here are 3 public repositories matching this topic...

martin-wey / R2Vul

R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation

vulnerability-detection knowledge-distillation reasoning large-language-models reinforcement-learning-from-ai-feedback

Updated Aug 5, 2025
Python

Chinmaya-Kausik / RLHF-comparison

Star

Comparing various RLHF methods

reinforcement-learning transformers transformer ppo dpo llm llms rlhf reinforcement-learning-from-human-feedback reinforcement-learning-from-ai-feedback

Updated Sep 23, 2024
Jupyter Notebook

satyampurwar / large-language-models

Star

Unlocking the Power of Generative AI: In-Context Learning, Instruction Fine-Tuning, Reinforcement Learning Fine-Tuning, Retrieval Augmented Generation and LangGraph Workflows for AI Agents.

Updated Jun 4, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the reinforcement-learning-from-ai-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement-learning-from-ai-feedback topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly