V3 → V4: What Changed and Where We're Going #11
2imi9
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
V3 → V4 Transition
We've upgraded the autonomous research runner from V3 to V4, adapting agent design patterns from Claude Code (Anthropic). This post covers what changed, what broke, what we learned, and what's next.
What V3 Achieved
What V4 Adds (Claude Code Patterns)
FlexAttention Upgrade
We discovered V3 was using SDPA without sliding window (the SSSL pattern was ignored). V4 now uses PyTorch FlexAttention which supports sliding window natively:
Flash Attention Status
The dual-repo FA3 logic (
varunnealfor Hopper,kernels-communityfor others) was accidentally lost during the V4 rewrite and has been restored.Proposal Pipeline Fix
V4's first test produced 0 LLM-generated proposals because:
<think>...</think>) broke the regex parserAll fixed with multi-strategy parsing, few-shot examples, and diagnostic logging.
Open Issues
PR
What's Next
64 unit tests passing. All code on
experiments/baselinebranch.Beta Was this translation helpful? Give feedback.
All reactions