eval scriptsMix both instead of separating validation and test- run evals on base model
- OPD script
- OPD run
GRPO script- GRPO run
- run evals on rl model
- OPSD script
- OPSD run
viratzzs/alignment-without-rewards
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|