Prime-RL / verifiers TSP environment (10-city hard config, lenient parser, eval-ready)
-
Updated
Nov 24, 2025 - Python
Prime-RL / verifiers TSP environment (10-city hard config, lenient parser, eval-ready)
Hybrid CDD RL environment scaffold for Prime Intellect (verifiers + prime-rl).
A verifiers RL environment that trains models to propose novel, evidence-grounded, falsifiable hypotheses. Rewards novelty with accountability.
A verifiers RLM environment for testing whether adaptive recursive search outperforms brittle manual RAG choreography on long synthetic corpora.
Add a description, image, and links to the prime-intellect topic page so that developers can more easily learn about it.
To associate your repository with the prime-intellect topic, visit your repo's landing page and select "manage topics."