Not a single route, not a single block, and not only a board package.
The public innovation object is a coupled regime:
- frozen authoritative parent
- residual-family local repairs
- route-gated trunk recurrent blocks
- coupled promotion gates
- derived bounded board artifact
See:
The runtime is table-driven at execution time, and local adaptation has clear prior art. The narrower claim here is that a family-local repair only counts if it remains anchored to the frozen parent, survives the promotion contract, and can be crystallized into a bounded artifact.
So the public claim is not "we invented PEFT" and not "this is a general LLM on MCU."
Yes.
The public board proof is a real ESP32-C3 run with raw JSON read back from the board report partition.
The public research line is backed by saved authoritative and external replay reports, exact ablation reports, paired replay statistics, and published audit JSON.
Because this repo separates:
host-side accepted surfacederived fixed-batch board proof
Current host-side line:
- official
IFEval = 0.780037 - official
LogiQA = 0.392523 external_dev = 0.308908external_blind = 0.425072
Current board proof:
249 / 642 = 0.3878504672897196host_full_match = 642
They are related, but they are not the same layer.
Because the board artifact is a constrained compiled fixed-batch proof, while the host-side line is the broader current host surface.
The board result proves:
- real MCU execution
- exact host-full alignment on the published batches
It does not claim unrestricted open-input board inference.
Publicly, a residual family is a bounded error slice defined by:
- the parent competition shape
- the local repair target pair
- the routed activation surface that decides where the repair may fire
The repo publishes accepted family-local repair evidence. It does not publish the full family-mining frontier.
See:
No, not by itself.
What is public here:
- accepted mechanism contract
- replayable public evidence
- representative locality probes
- exact no-trunk ablation
What remains private:
- family discovery pipeline
- support and carrier mining
- rejected frontier
- curriculum construction rules
- full training and materialization pipeline
Yes.
The repo now also publishes:
- a published-eval route-disabled proxy
- a no-topology current-surface control
- a depth-one current-surface control
- two target-only diagnostic controls
- a retrieval-only classic baseline
- a lexical-only classic baseline
- a same-parent trained linear-readout control
- a same-parent trained BitFit option-bias control
- a same-parent trained LoRA-style hash-delta control
- a same-parent trained low-rank adapter baseline
- a same-parent trainable-budget-matched LoRA-style baseline
- a same-parent trainable-budget-matched low-rank adapter baseline
- two same-parent low-rank adapter-style family controls
See:
Yes, but with a clear boundary.
The public bundle now includes:
- a same-parent retrieval-only baseline
- a same-parent lexical-only baseline
- a same-parent trained linear-readout control
- a same-parent trained BitFit option-bias control
- a same-parent trained LoRA-style hash-delta control
- a same-parent trained low-rank adapter baseline
- a same-parent trainable-budget-matched LoRA-style baseline
- a same-parent trainable-budget-matched low-rank adapter baseline
It also includes:
- route-disabled and topology-removal controls
- depth-one recurrence control
- target-only diagnostic controls
- same-parent low-rank adapter-style controls
These give useful rigor, and the repo now publishes trainable-budget-matched same-parent PEFT-style controls for the current public surface.
Comparator coverage is stronger, but still incomplete.
This repo therefore does not claim blanket dominance over nearby LoRA, BitFit, adapter, or local-editing baselines. The strongest public causal evidence today is:
- frozen parent
- exact no-trunk ablation
- route-disabled, topology-removal, and depth-one controls
- retrieval-only, lexical-only, trained linear, trained BitFit, trained LoRA-style, and trained low-rank adapter classic baselines
- trainable-budget-matched same-parent LoRA-style and low-rank adapter baselines
- target-only diagnostic controls
- architecture-near low-rank adapter controls
- replayable same-parent slices
- paired replay stats
- representative locality probes
What is still not claimed:
- blanket superiority over every nearby PEFT design
- transformer-module canonical LoRA superiority, because this parent is not a standard transformer PEFT target
- broad unseen-family generalization
See:
No.
The current public evidence supports:
- no obvious external collapse
- a real auditable board proof
- a replayable gain over the frozen parent
It does not support:
- broad unseen-family generalization
- proof of broad reasoning transfer
Additional release-boundary audit artifacts remain available in raw form under ../results/audit.
Yes, but it is deliberately scoped.
The repo now publishes:
- a host-side open-input structured JSON demo
That demo is real open input, but narrow:
- free-text prompt in
- structured JSON out
- exact-match and unsafe-guard evaluation
It is not the same thing as:
- open-input MCU deployment
See:
Because for constrained edge deployments:
- determinism matters
- postmortem analysis matters
- bounded behavior matters
- honest negative evidence matters
That is why this repo keeps the trust boundary visible instead of hiding it behind one headline score.