Summary
Track the first end-to-end release of MailPlus Intelligence using the recommended medium architecture.
Phase 1 target
full metadata/thread index
incremental sync
selective semantic extraction
wiki/entity promotion
raw email remains only in MailPlus
Prioritized implementation checklist
Roadmap: MailPlus remains canonical raw archive; Pheidon becomes the intelligence layer #1 Establish and document the canonical-store boundary
Decide metadata/thread storage backend and schema strategy #9 Choose the storage backend and schema direction
Build metadata and thread index for all MailPlus mail #2 Build the metadata/thread index
Define MailPlus locator and export contract #10 Define the MailPlus locator and export contract
Implement incremental MailPlus exporter/sync job #3 Implement incremental exporter/sync behavior
Add observability, checkpointing, and auditability for sync/extraction jobs #15 Add operational safety rails: checkpointing, logs, and auditability
Add search and on-demand raw message fetch workflows #4 Deliver search and on-demand raw message fetch
Define classification heuristics and evaluation set for important-vs-noise mail #12 Define measurable classification heuristics and noise suppression
Classify mail into VIP, project, admin, financial, travel, legal, and ignore/noise lanes #5 Implement classification lanes
Design selected-message text cache policy #11 Decide selected-message text cache boundaries
Define semantic output schema for summaries, entities, obligations, and events #13 Define semantic output schema
Generate thread summaries, entity updates, and obligation candidates from selected mail #6 Generate summaries, entities, obligations, and events
Design promotion/review workflow for wiki, memory, and reminders #14 Define promotion/review workflow
Promote distilled email intelligence into wiki and memory with review gates #7 Promote distilled intelligence into wiki/memory with review gates
Questions this phase should answer well
What’s my history with this person?
Did I already commit to this?
What admin, travel, financial, or legal follow-ups are pending?
Out of scope for phase 1
full semantic chunking on most mail
broad entity-graph magic before the basics are reliable
overbuilt proactivity before recall quality is proven
Definition of done
End-to-end flow exists from MailPlus source → index → retrieval → selected extraction → approved promotion.
Raw mail remains canonical in MailPlus.
Memory surfaces stay distilled and high-signal.
Summary
Track the first end-to-end release of MailPlus Intelligence using the recommended medium architecture.
Phase 1 target
Prioritized implementation checklist
Questions this phase should answer well
Out of scope for phase 1
Definition of done