Commit 465f264
committed
feat: Add paper review for 'Direct Preference Optimization: Your Language Model is Secretly a Reward Model'
1 parent 4e92145 commit 465f264
File tree
1 file changed
+775
-0
lines changed- _posts
1 file changed
+775
-0
lines changed
0 commit comments