experiment: incremental compilation by Kamirus · Pull Request #6003 · caffeinelabs/motoko

Kamirus · 2026-04-10T13:07:09Z

Experiment: Incremental Compilation

Experimental incremental compilation support for moc. Two caching strategies to reduce compile times when dependencies (packages like mo:base, mo:core) stay the same. Both use the --moi-cache <dir> flag.

Experiment 1: Scope caching (`.moi`) — `--check` mode

Goal: Speed up repeated type-checking by caching each library's type-checked scope.

Approach: Serialize Scope.t per library to .moi files using a custom binary format, handling cyclic type constructors. Cache key is a Merkle hash of the library's source content and its transitive dependencies. On cache hit, type-checking for that library is skipped entirely.

Results (Map.test.mo, motoko-core, $(mops sources), 78 dependency files):

Scenario	Time
`--check` (no cache)	~287ms
`--check` + `.moi` cache (cold)	~287ms
`--check` + `.moi` cache (warm)	~229ms

The scope cache saves ~58ms in check mode (~20% of check work).

Conclusion: Modest speedup. Type-checking is already fast, so caching it saves little in absolute terms.

Profiling: where does compile time go?

This experiment also revealed the full compile-time breakdown, motivating experiment 2:

Phase	Time	%
Parse + check (per-lib)	167ms	11%
Lowering	61ms	4%
7 IR passes (whole-program)	1,076ms	73%
Codegen	164ms	11%
Total (`-c`)	~1,760ms

The seven whole-program IR passes (erase_typ_field, show, eq, await, async, tailcall, const) dominate compile time at 73%.

Experiment 2: IR caching (`.moic`) — `-c` compile mode

Goal: Speed up repeated compilation by caching post-IR-pass library declarations.

Approach: After lowering and running all seven IR passes on library code, serialize the resulting Ir.dec list to .moic files using Marshal. On warm cache, the compiler skips lowering and all IR passes for unchanged libraries — only the main program goes through the full pipeline, then gets linked with cached library IR before codegen.

Results (Map.test.mo, motoko-core, $(mops sources)):

Build	CPU time	Speedup
Baseline (`-c`)	1.56s	—
Cold cache	1.74s	0.9x (write overhead)
Warm cache	0.68s	2.3x

Cold and warm produce byte-identical Wasm. Cache invalidation works correctly.

Where the remaining time goes (warm, 0.68s):

~0.20s — parse + typecheck all files (re-done every time)
~0.48s — lower main program + codegen on the full combined program

Codegen is now the bottleneck — it still walks all declarations (cached + main). Eliminating this would require separate Wasm compilation + linking, which is a fundamentally larger change.

Overall conclusion

~350 lines of new code for a 2.3x compile-mode speedup. The IR caching approach works but hits diminishing returns: the next meaningful improvement requires Wasm-level separate compilation, not more caching. This POC uses Marshal (tied to exact compiler binary) and has no cache eviction — not production-ready as-is.

Test plan

test/run/moi-cache.mo — roundtrip: counter, types with generics, recursive tree types
test/fail/moi-cache-error.mo — type errors still reported correctly with cache
Compile-mode IR caching — byte-identical Wasm on Map.test.mo, cache invalidation verified

Add `--moi-cache <dir>` flag to cache pre-compiled dependency scopes to disk, enabling incremental type-checking in `--check` mode. When a dependency's source hash and transitive dependency fingerprints match the cached entry, the type-checker loads the cached Scope.t directly instead of re-parsing and re-checking the file. Implementation: - Custom binary serialization for Scope.t and Type.t, handling cyclic type constructors via a create-then-fixup pattern - Merkle fingerprinting (SHA-256 of source hash + sorted dep fingerprints) for transitive cache invalidation - Compiler version (Source_id.id) embedded in cache headers to reject stale caches from different compiler builds - Atomic file writes (write to .tmp, rename) for crash safety - Mixin libraries correctly skipped (not cacheable) Currently restricted to `--check` mode. Compile-mode IR caching is planned as a follow-up (see .cursor/plans/). Made-with: Cursor

Add detailed plan for caching post-IR-pass library decs to skip parsing, type-checking, lowering, and all 7 IR passes for unchanged dependencies during compilation. Expected ~74% faster compiles. Made-with: Cursor

Cache post-IR-pass library declarations to skip lowering, IR passes, and partial codegen work for unchanged dependencies. On warm cache, only the main program is lowered and passed through IR transforms, then linked with cached library IR before codegen. Benchmarked on motoko-core Map.test.mo: - Baseline: 1.56s → Warm cache: 0.68s (2.3x speedup) Key changes: - ir_cache.ml: Marshal-based serialization of Ir.dec list + id_stamps with binary hash validation and atomic writes - pipeline.ml: split compile path into cached/uncached, with compile_combined_prog handling link + codegen - const.ml: accept known_const parameter for fragment analysis - cons.ml: bump_stamps_past to prevent stamp collisions after deser - construct.ml: get/set_id_stamps for fresh name counter continuity Made-with: Cursor

ggreif · 2026-04-11T18:59:44Z

Do you use https://github.com/ocaml-ppx/ppx_deriving_protobuf#usage ?

Kamirus · 2026-04-13T08:37:57Z

Do you use https://github.com/ocaml-ppx/ppx_deriving_protobuf#usage ?

No, it was a quick POC measuring the speedup potential by caching the typing env and the IR.
So it was easier to do the de/serialization manually than to refactor our current codebase to fit in to the 'deriving' requirements

crusso · 2026-04-20T17:44:27Z

Something is a bit fishy since I don't think we currently run IR-passes on library code at all - we just do them on the combined IR.

Won't this produce more and redundant code, like multiple versions of the same eq and show functions?

Kamirus · 2026-04-24T07:38:42Z

Something is a bit fishy since I don't think we currently run IR-passes on library code at all - we just do them on the combined IR.

Won't this produce more and redundant code, like multiple versions of the same eq and show functions?

Yes it would. I was just experimenting with this idea first trying to measure how faster compilation would get if all lib-IR was cached.

Kamirus added 2 commits April 10, 2026 15:06

old plan

6a9143f

Kamirus changed the title ~~feat: incremental compilation experiments~~ experiment: incremental compilation experiments Apr 10, 2026

Kamirus changed the title ~~experiment: incremental compilation experiments~~ experiment: incremental compilation Apr 10, 2026

Kamirus added 2 commits April 10, 2026 15:54

plan: compile-mode IR caching design

76a69e7

Add detailed plan for caching post-IR-pass library decs to skip parsing, type-checking, lowering, and all 7 IR passes for unchanged dependencies during compilation. Expected ~74% faster compiles. Made-with: Cursor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment: incremental compilation#6003

experiment: incremental compilation#6003
Kamirus wants to merge 4 commits intomasterfrom
kamil/incremental-compilation

Kamirus commented Apr 10, 2026 •

edited

Loading

Uh oh!

ggreif commented Apr 11, 2026

Uh oh!

Kamirus commented Apr 13, 2026 •

edited

Loading

Uh oh!

crusso commented Apr 20, 2026

Uh oh!

Kamirus commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Kamirus commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!