feat(nqx-core): pre-silicon emulator and chip development kit#1
Merged
feat(nqx-core): pre-silicon emulator and chip development kit#1
Conversation
NQX-Core is a software model of the domain-specific dataflow processor that the NautilusQuant pipeline maps to 1:1. Lives at nqx-core/ inside this repo. What's added under nqx-core/: - 95 Python modules (emulator, ISA, assembler+disassembler, 7 functional units, cycle-accurate pipeline, energy model, coverage, JTAG TAP model, perf counters) - 7 SystemVerilog modules + Verilator testbench + ROM .mem + Yosys synth flow + OpenLane2 config (Skywater 130nm MPW path) + SymbiYosys formal verification harness - 50 mm² TSMC 7nm floor-plan + 1 GHz timing closure analysis + 9-section pre-tape-out checklist - FastAPI HTTP service (auto CPU/GPU backend, middleware logging, Prometheus metrics, structured errors, /health/deep with LUT hash) - Demo: TurboQuant baseline emulation, end-to-end attention, Llama-3-70B scaling projection, side-by-side comparison, pitch deck (10 slides), ASCII visualizations, demo runner, Jupyter notebooks - 6 proof artifacts: angular uniformity, linear-vs-Lloyd-Max, phi-vs-random, determinism witness, LUT budget, energy delta vs random - Pre-silicon SDK: random instruction generator, coverage tracking, libnqx C ABI, Linux driver skeleton, boot ROM, programming guide, errata, install.sh, SDK README - 16 CLI launchers (nqx-claude, nqx-deepseek, nqx-trio, nqx-audit, nqx-demo, nqx-debug, nqx-launch-all, nqx-publish, nqx-asm, nqx-disasm, nqx-sim, nqx-rig, nqx-status, nqx-stats, nqx-doctor, nqx-debug-jtag) + KDE Plasma desktop shortcut - Multi-arch Docker (amd64+arm64) + GPU image + vast.ai automation (deploy/quickstart-vastai.sh) + smoke test + load test - 46 test files, 241 tests passing in <20s Top-level changes in this PR: - README.md: added pointer section to nqx-core/ - .github/workflows/nqx-core-ci.yml: CI for the subdirectory only Acceptance criteria measured: - Orthogonality T^Tᵀ·T = I error: 1.6e-7 (target < 1e-6) - Roundtrip without quantization RMSE: 9.6e-8 (target < 1e-6) - Bit-exact match against this repo's nautilus_triton.py: max diff < 1e-4 - Compression ratio: exactly 4.00x - Throughput: 1 vec/cycle steady-state (18-cycle pipeline depth) - ROM-LUT (dim=128): 1910 bytes (vs 32 KB random rotation matrix) - Determinism: 100/100 runs bit-identical Built using parallel AI agents (Claude + DeepSeek-V4) coordinated through structured task lists in nqx-core/audits/prompts/. Full development log preserved for transparency.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
NQX-Core is a software model of the domain-specific dataflow processor that
the NautilusQuant pipeline maps to 1:1. Lives at nqx-core/ inside this repo.
What's added under nqx-core/:
units, cycle-accurate pipeline, energy model, coverage, JTAG TAP model,
perf counters)
flow + OpenLane2 config (Skywater 130nm MPW path) + SymbiYosys formal
verification harness
9-section pre-tape-out checklist
Prometheus metrics, structured errors, /health/deep with LUT hash)
scaling projection, side-by-side comparison, pitch deck (10 slides),
ASCII visualizations, demo runner, Jupyter notebooks
phi-vs-random, determinism witness, LUT budget, energy delta vs random
libnqx C ABI, Linux driver skeleton, boot ROM, programming guide,
errata, install.sh, SDK README
nqx-demo, nqx-debug, nqx-launch-all, nqx-publish, nqx-asm, nqx-disasm,
nqx-sim, nqx-rig, nqx-status, nqx-stats, nqx-doctor, nqx-debug-jtag) +
KDE Plasma desktop shortcut
(deploy/quickstart-vastai.sh) + smoke test + load test
Top-level changes in this PR:
Acceptance criteria measured:
Built using parallel AI agents (Claude + DeepSeek-V4) coordinated through
structured task lists in nqx-core/audits/prompts/. Full development log
preserved for transparency.