Skip to content

feat(nqx-core): pre-silicon emulator and chip development kit#1

Merged
ORTODOX1 merged 1 commit intomainfrom
feat/nqx-core-emulator
Apr 30, 2026
Merged

feat(nqx-core): pre-silicon emulator and chip development kit#1
ORTODOX1 merged 1 commit intomainfrom
feat/nqx-core-emulator

Conversation

@ORTODOX1
Copy link
Copy Markdown
Owner

NQX-Core is a software model of the domain-specific dataflow processor that
the NautilusQuant pipeline maps to 1:1. Lives at nqx-core/ inside this repo.

What's added under nqx-core/:

  • 95 Python modules (emulator, ISA, assembler+disassembler, 7 functional
    units, cycle-accurate pipeline, energy model, coverage, JTAG TAP model,
    perf counters)
  • 7 SystemVerilog modules + Verilator testbench + ROM .mem + Yosys synth
    flow + OpenLane2 config (Skywater 130nm MPW path) + SymbiYosys formal
    verification harness
  • 50 mm² TSMC 7nm floor-plan + 1 GHz timing closure analysis +
    9-section pre-tape-out checklist
  • FastAPI HTTP service (auto CPU/GPU backend, middleware logging,
    Prometheus metrics, structured errors, /health/deep with LUT hash)
  • Demo: TurboQuant baseline emulation, end-to-end attention, Llama-3-70B
    scaling projection, side-by-side comparison, pitch deck (10 slides),
    ASCII visualizations, demo runner, Jupyter notebooks
  • 6 proof artifacts: angular uniformity, linear-vs-Lloyd-Max,
    phi-vs-random, determinism witness, LUT budget, energy delta vs random
  • Pre-silicon SDK: random instruction generator, coverage tracking,
    libnqx C ABI, Linux driver skeleton, boot ROM, programming guide,
    errata, install.sh, SDK README
  • 16 CLI launchers (nqx-claude, nqx-deepseek, nqx-trio, nqx-audit,
    nqx-demo, nqx-debug, nqx-launch-all, nqx-publish, nqx-asm, nqx-disasm,
    nqx-sim, nqx-rig, nqx-status, nqx-stats, nqx-doctor, nqx-debug-jtag) +
    KDE Plasma desktop shortcut
  • Multi-arch Docker (amd64+arm64) + GPU image + vast.ai automation
    (deploy/quickstart-vastai.sh) + smoke test + load test
  • 46 test files, 241 tests passing in <20s

Top-level changes in this PR:

  • README.md: added pointer section to nqx-core/
  • .github/workflows/nqx-core-ci.yml: CI for the subdirectory only

Acceptance criteria measured:

  • Orthogonality T^Tᵀ·T = I error: 1.6e-7 (target < 1e-6)
  • Roundtrip without quantization RMSE: 9.6e-8 (target < 1e-6)
  • Bit-exact match against this repo's nautilus_triton.py: max diff < 1e-4
  • Compression ratio: exactly 4.00x
  • Throughput: 1 vec/cycle steady-state (18-cycle pipeline depth)
  • ROM-LUT (dim=128): 1910 bytes (vs 32 KB random rotation matrix)
  • Determinism: 100/100 runs bit-identical

Built using parallel AI agents (Claude + DeepSeek-V4) coordinated through
structured task lists in nqx-core/audits/prompts/. Full development log
preserved for transparency.

NQX-Core is a software model of the domain-specific dataflow processor that
the NautilusQuant pipeline maps to 1:1. Lives at nqx-core/ inside this repo.

What's added under nqx-core/:
- 95 Python modules (emulator, ISA, assembler+disassembler, 7 functional
  units, cycle-accurate pipeline, energy model, coverage, JTAG TAP model,
  perf counters)
- 7 SystemVerilog modules + Verilator testbench + ROM .mem + Yosys synth
  flow + OpenLane2 config (Skywater 130nm MPW path) + SymbiYosys formal
  verification harness
- 50 mm² TSMC 7nm floor-plan + 1 GHz timing closure analysis +
  9-section pre-tape-out checklist
- FastAPI HTTP service (auto CPU/GPU backend, middleware logging,
  Prometheus metrics, structured errors, /health/deep with LUT hash)
- Demo: TurboQuant baseline emulation, end-to-end attention, Llama-3-70B
  scaling projection, side-by-side comparison, pitch deck (10 slides),
  ASCII visualizations, demo runner, Jupyter notebooks
- 6 proof artifacts: angular uniformity, linear-vs-Lloyd-Max,
  phi-vs-random, determinism witness, LUT budget, energy delta vs random
- Pre-silicon SDK: random instruction generator, coverage tracking,
  libnqx C ABI, Linux driver skeleton, boot ROM, programming guide,
  errata, install.sh, SDK README
- 16 CLI launchers (nqx-claude, nqx-deepseek, nqx-trio, nqx-audit,
  nqx-demo, nqx-debug, nqx-launch-all, nqx-publish, nqx-asm, nqx-disasm,
  nqx-sim, nqx-rig, nqx-status, nqx-stats, nqx-doctor, nqx-debug-jtag) +
  KDE Plasma desktop shortcut
- Multi-arch Docker (amd64+arm64) + GPU image + vast.ai automation
  (deploy/quickstart-vastai.sh) + smoke test + load test
- 46 test files, 241 tests passing in <20s

Top-level changes in this PR:
- README.md: added pointer section to nqx-core/
- .github/workflows/nqx-core-ci.yml: CI for the subdirectory only

Acceptance criteria measured:
- Orthogonality T^Tᵀ·T = I error: 1.6e-7 (target < 1e-6)
- Roundtrip without quantization RMSE: 9.6e-8 (target < 1e-6)
- Bit-exact match against this repo's nautilus_triton.py: max diff < 1e-4
- Compression ratio: exactly 4.00x
- Throughput: 1 vec/cycle steady-state (18-cycle pipeline depth)
- ROM-LUT (dim=128): 1910 bytes (vs 32 KB random rotation matrix)
- Determinism: 100/100 runs bit-identical

Built using parallel AI agents (Claude + DeepSeek-V4) coordinated through
structured task lists in nqx-core/audits/prompts/. Full development log
preserved for transparency.
@ORTODOX1 ORTODOX1 merged commit 06a9131 into main Apr 30, 2026
0 of 3 checks passed
@ORTODOX1 ORTODOX1 deleted the feat/nqx-core-emulator branch April 30, 2026 07:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant