Commit 225347a
committed
fix: Correct some errors in 'Direct Preference Optimization: Your Language Model is Secretly a Reward Model'
1 parent ec61f46 commit 225347a
File tree
1 file changed
+58
-66
lines changed- _posts
1 file changed
+58
-66
lines changed
0 commit comments