ZYN: Zero-Shot Reward Models with Yes-No Questions
-
Updated
Aug 15, 2023 - Python
ZYN: Zero-Shot Reward Models with Yes-No Questions
Official code for "Unsupervised Text Style Transfer with Controllable Intensity" — a two-stage SFT + PPO framework for fine-grained control over text readability and sentiment transfer.
Add a description, image, and links to the trlx topic page so that developers can more easily learn about it.
To associate your repository with the trlx topic, visit your repo's landing page and select "manage topics."