Skip to content

pasta99/RewardingDoubt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Rewarding Doubt

Code implementation of Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models (https://arxiv.org/abs/2503.02623).

See the documentation in the single and multiple answer setting folders for usage.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors