Skip to content
#

mathematical-reasoning

Here are 23 public repositories matching this topic...

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

  • Updated Jul 9, 2025
  • Python

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀 A batteries‑included training & inference framework for **MLX**‑based language models on Apple Silicon.

  • Updated Sep 8, 2025
  • Python

Improve this page

Add a description, image, and links to the mathematical-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mathematical-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more