Anchor application and evidence repo for the Indus lane. Useful now, improving without overclaim.
This is a live window into the Zer0pa lab. None of the work in this repo is a final release. Zer0pa GitHub repos are open as a matter of course for visibility and community participation. Always-in-beta is the posture: useful now, improving continuously. Open gates and honest blockers are surfaced in plain text below — see ## What We Don't Claim and ## Upcoming Workstreams.
This repository is part of the Zer0pa Gnosis Portfolio.
Code in this repository is licensed under the Apache License 2.0. See
LICENSE for the full text. SPDX identifier: Apache-2.0.
Documentation, reports, and written materials are licensed under Creative
Commons Attribution 4.0 International. SPDX identifier: CC-BY-4.0. Canonical
terms: https://creativecommons.org/licenses/by/4.0/.
Data and fixtures are handled per dataset and artifact family. See
DATA_POLICY.md for this repository's data boundary. The code license does not
license raw corpora, image-bearing cultural-heritage assets, private HF
artifacts, model weights, endpoint logs, or operational transcripts.
Trademarks - "Gnosis", "Zer0pa Gnosis", and distinctive sub-marks are
trademarks of Zer0pa. Apache-2.0 and CC-BY-4.0 do not grant trademark rights.
See TRADEMARKS.md.
Public visibility is a separate repository-setting action. The license files in this repo define the intended open-source/open-documentation terms for released Gnosis code and written materials; they do not publish rights-gated data.
This repository works with material originating from the Indus Valley Civilization, including the undeciphered Indus script. Zer0pa claims no proprietary right over the Indus script, its imagery, or the archaeological record.
Zer0pa's work is the computational analysis, lattice methods, and falsification discipline applied to that material. This repository does not claim a definitive decipherment of any Indus sign or sequence.
Good-faith inquiries from identified communities, governmental antiquity authorities, or institutional bodies may be sent to architects@zer0pa.ai.
Gnosis applied-research repo for the Indus-script lane. It packages the carried-forward evidence chain around the Indus morphological catalogue (admitted at k=70 with a stability caveat), the Phase 5 falsification work (linguistic structure confirmed, substrate identification not), the Paper 1 / Paper 2 verdict stack, and a search-without-decode application surface implemented in this repo as the first runtime slice. It is not a decipherment repo, not a generic search product, and not the portfolio's lead thesis.
Headline metric: pytest -q → 14 passed (≈0.03 s local, ≈0.3 s
on RunPod from fresh clone). The clean-room search-without-decode
runtime (src/gnosis_indus/search_surface/) reproduces 6
authority-doc query records on the demo fixture
(artifacts/phase4/indus_catalogue_demo_fixture.json), with
sequence_search median latency well under the authority-doc 100 ms
gate (authority/review_pack/search_demo_summary.md). Per-phase
verifications: .gpd/phases/0{1,2,3}-*/VERIFICATION.md — all PASS,
10/10 confidence.
Honest blocker: Image-bearing sign rights remain BLOCKED_RIGHTS
per DATA_POLICY.md; the full k=70 catalogue (412 signs / 70 clusters
/ 179 inscriptions) stays FETCH_EXTERNAL. The bundled fixture is
small and authority-anchored to
authority/review_pack/search_demo_summary.md.
- The lane is migrated out of the original monorepo as a standalone, truthful scaffold without losing the Phase 4 stability caveat or the Phase 5 non-decipherment posture.
- A clean-room search-without-decode runtime
(
src/gnosis_indus/search_surface/) anchored toauthority/review_pack/search_demo_summary.mdreproduces six authority-doc query records on a small authority-anchored demo fixture, withsequence_searchmedian latency well under the authority-doc 100 ms gate. - A clean-machine replay path exists: any Python 3.11 host can
reproduce the test surface via
pip install -e ".[test]" && pytest. - Off-repo custody (private HF dataset + model repos) is provisioned
for future heavy-artifact promotion under
DATA_POLICY.mdclassification, with the register documented inHF_CUSTODY_REGISTER.md.
- We do not claim decipherment of the Indus script.
- We do not claim proven substrate identification.
- We do not claim all rights gates are cleared. Image rights for sign-bearing releases remain open; this is an open lab, not a finished product.
- We do not claim unrestricted public redistribution rights for any
image-bearing or rights-gated corpus referenced in the original
monorepo work; sign images stay
BLOCKED_RIGHTSinDATA_POLICY.md. - We do not claim the bundled fixture is the real full catalogue. The
full k=70 catalogue (412 signs, 70 clusters, 179 inscriptions) stays
FETCH_EXTERNALperDATA_POLICY.md.
| Surface | Status |
|---|---|
pytest -q (Python 3.11) |
14 passed (≈0.03 s local, ≈0.3 s on RunPod from fresh clone) |
python -m compileall src |
passes |
| Per-phase verification reports | .gpd/phases/0{1,2,3}-*/VERIFICATION.md (all PASS, latest 10/10 confidence) |
| GitHub Actions CI | wired in .github/workflows/ci.yml (boring CI: install + pytest, no external fetches) |
| Operational endpoint leak scan | 0 matches across all tracked surfaces |
| Falsification battery (PRD §1.3) | all four conditions hold (k=70 caveat preserved; no decipherment language; search not reframed as sovereign repo; no image-rights inflation) |
| Claim | Evidence |
|---|---|
| Phase 4 conditional catalogue at k=70 with stability caveat | authority/review_pack/phase4_governing_verdict.md, authority/review_pack/indus_catalogue_summary.md |
| Phase 5 linguistic-structure-without-decipherment | authority/review_pack/phase5_governing_verdict.md |
| Paper 1 (DSH-ready) and Paper 2 (held until Paper 1 submission) verdicts | authority/papers/paper1_governing_verdict_v2.md, authority/papers/paper2_governing_verdict.md |
| Search-without-decode functional spec (API + latency/compression gates + 10 query ground-truth records) | authority/review_pack/search_demo_summary.md |
| Clean-room runtime reproducing the spec | src/gnosis_indus/search_surface/{__init__,catalogue,engine,_fixture}.py |
| Authority-anchored demo fixture | artifacts/phase4/indus_catalogue_demo_fixture.json (every row traced verbatim to a line in the authority doc; see artifacts/phase4/README.md) |
| Phase 02 verification (10/10) | .gpd/phases/02-extraction-and-minimal-replay-surface/VERIFICATION.md |
| Phase 03 verification (10/10, PRD complete) | .gpd/phases/03-truth-preserving-packaging/VERIFICATION.md |
| Decision log (six rows, each with rollback trigger) | .gpd/DECISIONS.md |
| Off-repo custody register | HF_CUSTODY_REGISTER.md |
.
├── NOTICE # Apache-2.0 code and CC-BY-4.0 docs posture
├── README.md # this file
├── CHANGELOG # release history
├── CITATION.cff # citation metadata
├── CODE_OF_CONDUCT.md # community conduct standards
├── PRD_GNOSIS_INDUS_ANCHOR_APPLICATION.md # sovereign brief
├── AGENTS.md # autonomous-agent rules
├── AUTHORITY_SNAPSHOT.md # carried-forward truth
├── AUDITOR_PLAYBOOK.md # outsider-audit fast path (pre-Phase-02 vintage; refresh pending)
├── PUBLIC_AUDIT_LIMITS.md # what audit can and cannot conclude
├── HF_CUSTODY_REGISTER.md # off-repo storage truth
├── SOURCE_BOUNDARY.md # source families included / deferred / excluded
├── DATA_POLICY.md # data classes and rights posture
├── ROADMAP.md # milestone tracker
├── _internal/ # scaffolding and internal orchestration docs
│ ├── MIGRATION_PLAN.md # extraction roadmap and waves (internal)
│ ├── STATUS_REPORT_2026-04-24.md # orchestrator execution narrative (internal)
│ ├── AUTONOMOUS_EXECUTION_POLICY.md
│ ├── GPD_BOOTSTRAP_GUIDE.md
│ ├── STARTUP_PROMPT.md
│ ├── UNIVERSAL_STARTUP_PROMPT.md
│ ├── WORKSTREAM_GPD_INIT_CHECKLIST.md
│ └── TEMPLATE_USAGE.md
├── docs/
│ ├── ARCHITECTURE.md
│ ├── LEGAL_BOUNDARIES.md
│ ├── FAQ.md
│ ├── SUPPORT.md
│ └── family/INDUS_EXPORT_CONTRACT.md
├── authority/ # exact-source-only verdict copies
│ ├── review_pack/ # Phase 4/5 governing verdicts and PRDs
│ └── papers/ # Paper 1/2 governing verdicts
├── src/gnosis_indus/
│ └── search_surface/ # Phase 02 first runtime slice (clean-room)
├── artifacts/phase4/
│ └── indus_catalogue_demo_fixture.json # authority-anchored demo fixture
├── tests/
│ └── test_search_surface.py # 14 tests reproducing authority queries
├── .gpd/ # GPD control plane (PROJECT, STATE, ROADMAP, REQUIREMENTS, DECISIONS, CONVENTIONS, phase plans + verifications)
└── .github/
├── ISSUE_TEMPLATE/
├── PULL_REQUEST_TEMPLATE.md
└── workflows/ci.yml # boring CI: install + pytest
Reproduce the Phase 02 stronger smoke path on any clean Python 3.11 host:
git clone https://github.com/Zer0pa/Indus-Valley.git gnosis-indus
cd gnosis-indus
python3.11 -m venv .venv && source .venv/bin/activate
pip install -e ".[test,numerics]"
pytest -qExpected: 14 passed. The pytest suite reproduces the authority-doc
query records from authority/review_pack/search_demo_summary.md
against the bundled artifacts/phase4/indus_catalogue_demo_fixture.json.
The fixture is small and authority-anchored; the real full catalogue
stays FETCH_EXTERNAL per DATA_POLICY.md. The Phase 4 stability
caveat (k=70 conditional) remains visible in the package and fixture
surfaces.
- Image-rights and provenance for any public image-bearing release
remain unresolved; sign images stay
BLOCKED_RIGHTSinDATA_POLICY.md. - License text has landed as Apache-2.0 for code and CC-BY-4.0 for docs. Public visibility remains separate and is blocked on rights, provenance, and release wording review.
- Phase 02 landed only the search-without-decode slice. The Phase
4 catalogue and Phase 5 falsification slices are sequenced as later
extraction waves (
MIGRATION_PLAN.md,SOURCE_BOUNDARY.md). - Full k=70 catalogue (412 signs, 70 clusters, 179 inscriptions) is not vendored; the bundled demo fixture reproduces only what the authority doc enumerates.
AUDITOR_PLAYBOOK.mdis pre-Phase-02 vintage. Functionally superseded by the README Quick Start and the Phase 03 verification reports (.gpd/phases/03-truth-preserving-packaging/VERIFICATION.md), but the playbook itself awaits a maintenance refresh.
| Field | Value |
|---|---|
| Verdict | STAGED |
| Posture | rights_gated_data_classes_image_blocked_text_fetch_external |
The runtime and test surface are functional and reproducible from a fresh clone.
Image-rights and full-catalogue redistribution gates remain open; those are
explicit, not hidden. STAGED reflects: production-quality scaffold, not yet
cleared for all data releases.
- AUDITOR_PLAYBOOK.md maintenance refresh — currently self-flagged
"pre-Phase-02 vintage"; should be updated to reference the Phase 02 / 03
verification reports and the Quick Start replay path. (
AUDITOR_PLAYBOOK.md) - Phase 4 catalogue extraction wave — extract first clean runtime slice
into
src/gnosis_indus/catalogue/per_internal/MIGRATION_PLAN.mdwave 2. - Phase 5 falsification extraction wave — extract first clean runtime slice
into
src/gnosis_indus/falsification/per_internal/MIGRATION_PLAN.mdwave 3.
- Image-rights and provenance review for any image-bearing public release;
sign images remain
BLOCKED_RIGHTSinDATA_POLICY.mduntil a rights review is completed by the relevant authority. - Full k=70 catalogue redistribution rights review — the 412-sign / 70-cluster
/ 179-inscription catalogue stays
FETCH_EXTERNALuntil cleared.
(none currently; Phase 5 posture — linguistic structure confirmed, substrate identification not — is stable and not under active re-investigation.)
(none currently.)