Chore/issue 368 architecture (#372)

chenziliang · web-flow · commit cf4612e977bc · 2025-08-15T22:48:54.000-07:00
diff --git a/docs/architecture.md b/docs/architecture.md
@@ -2,35 +2,86 @@
 
 ## High Level Architecture
 
-The following diagram depicts the high level architecture of Timeplus SQL engine, starting from a single node deployment.
+The following diagram depicts the high level components of Timeplus core engine.
 
 ![Architecture](/img/proton-high-level-arch.gif)
 
-All of the components / functionalities are built into one single binary.
+### The Flow of Data
 
-## Data Storage
+#### Ingest
 
-Users can create a stream by using `CREATE STREAM ...` [DDL SQL](/sql-create-stream). Every stream has 2 parts at storage layer by default:
+When data is ingested into Timeplus, it first lands in the NativeLog. As soon as the log commit completes, the data becomes immediately available for streaming queries.
 
-1. the real-time streaming data part, backed by Timeplus NativeLog
-2. the historical data part, backed by ClickHouse historical data store.
+In the background, dedicated threads continuously tail new entries from the NativeLog and flush them to the Historical Store in larger, optimized batches.
 
-Fundamentally, a stream in Proton is a regular database table with a replicated Write-Ahead-Log (WAL) in front but is streaming queryable.
+#### Query
 
-## Data Ingestion
+Timeplus supports three query modes: **historical**, **streaming**, and **hybrid (streaming + historical)**.
 
-When users `INSERT INTO ...` data to Proton, the data always first lands in NativeLog which is immediately queryable. Since NativeLog is in essence a replicated Write-Ahead-Log (WAL) and is append-only, it can support high frequent, low latency and large concurrent data ingestion work loads.
+- **Historical Query (a.k.a. Table Query)**
 
-In background, there is a separate thread tailing the delta data from NativeLog and commits the data in bigger batch to the historical data store. Since Proton leverages ClickHouse for the historical part, its historical query processing is blazing fast as well.
+  Works like a traditional database query. Data is fetched directly from the **Historical Store**, and all standard database optimizations like the following apply. These optimizations accelerate large-scale scans and point lookups, making historical queries fast and efficient.
+  - Primary index
+  - Skipping index
+  - Secondary index
+  - Bloom filter
+  - Partition pruning
 
-## External Stream
+- **Streaming Query**
 
-In quite lots of scenarios, data is already in Kafka / Redpanda or other streaming data hubs, users can create [external streams](/external-stream) to point to the streaming data hub and do streaming query processing directly and then either materialize them in Proton or send the query results to external systems.
+  Operates on the **NativeLog**, which stores records in sequence. Queries run incrementally, enabling real-time processing patterns such as **incremental ETL**, **joins**, and **aggregations**.
 
+- **Hybrid Query**
 
+  Streaming queries can automatically **backfill** from the Historical Store when:
+  1. Data no longer exists in the NativeLog (due to retention policies).
+  2. Pulling from the Historical Store is faster than rewinding the NativeLog to replay old events.
 
-## Learn More
+  This allows seamless handling of scenarios like **fast backfill** and **mixed real-time + historical analysis** without breaking query continuity and also don't need yet another external batch system to load the historical data which usually introduce worse latency, inconsitency and cost.
 
-Interested users can refer [How Timeplus Unifies Streaming and Historical Data Processing](https://www.timeplus.com/post/unify-streaming-and-historical-data-processing) blog for more details regarding its academic foundation and latest industry developments. You can also check the video below from [Kris Jenkins's Developer Voices podcast](https://www.youtube.com/watch?v=TBcWABm8Cro). Jove shared our key decision choices, how Timeplus manages data and state, and how Timeplus achieves high performance with single binary.
+### The Dural Storage
 
-<iframe width="560" height="315" src="https://www.youtube.com/embed/QZ0le2WiJiY?si=eF45uwlXvFBpMR14" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+#### NativeLog
+
+The **Timeplus NativeLog** is the system’s write-ahead log (WAL) or journal: an append-only, high-throughput store optimized for low-latency, highly concurrent data ingestion. In a cluster deployment, it is replicated using **Multi-Raft** for fault tolerance. By enforcing a strict ordering of records, NativeLog forms the backbone of streaming processing in **Timeplus Core**.
+
+NativeLog uses its own record format, consisting of two high-level types:
+
+- **Control records** (a.k.a. meta records) – store metadata and operational information.
+- **Data records** – columnar-encoded for fast serialization/deserialization and efficient vectorized streaming execution.
+
+Each record is assigned a monotonically increasing sequence number—similar to a Kafka offset—which guarantees ordering.
+
+Lightweight indexes are maintained to support rapid rewind and replay operations by **timestamp** or **sequence number** in streaming queries.
+
+#### Historical Store
+
+The **Historical Store** in Timeplus stores data **derived** from the **NativeLog**. It powers use cases such as:
+
+- **Historical queries** (a.k.a. *table queries* in Timeplus)
+- **Fast backfill** into streaming queries
+- Acting as a **serving layer** for downstream applications
+
+Timeplus supports two storage encodings for the Historical Store: **columnar** and **row**.
+
+##### 1. Columnar Encoding (*Append Stream*)
+Optimized for **append-most workloads** with minimal data mutation, such as telemetry or event logs. Benefits include:
+
+- High data compression ratios
+- Blazing-fast scans for analytical workloads
+- Backed by the **ClickHouse MergeTree** engine
+
+This format is ideal when the dataset is largely immutable and query speed over large volumes is a priority.
+
+##### 2. Row Encoding (*Mutable Stream*)
+Designed for **frequently updated datasets** where `UPSERT` and `DELETE` operations are common. Features include:
+
+- Per-row **primary indexes**
+- **Secondary indexes** for flexible lookups
+- Faster and more efficient **point queries** compared to columnar storage
+
+Row encoding is the better choice when low-latency, high-frequency updates are required.
+
+## References
+
+[How Timeplus Unifies Streaming and Historical Data Processing](https://www.timeplus.com/post/unify-streaming-and-historical-data-processing)
diff --git a/docs/why-timeplus.md b/docs/why-timeplus.md
@@ -75,4 +75,4 @@ SQL-based rules can be used to trigger or resolve alerts in systems such as Page
 
 ### Scalability and Elasticity 
 
-Timeplus supports both MPP architecture for pure on-prem deployments—ideal when ultra-low latency is critical and storage/compute separation for elastic, cloud-native setups. In the latter mode, S3 is used for the NativeLog, Historical Store, and Query State Checkpoints. Combined with Kubernetes HPA or AWS Auto Scaling Groups, this enables highly concurrent continuous queries on clusters that scale automatically with demand.
+Timeplus supports three deployment models: **MPP (shared-nothing)** for on-premises setups where ultra-low latency is critical, **storage/compute separation** for elastic cloud-native environments using S3 (or similar object storage) to store the NativeLog, Historical Store, and Query State Checkpoints with zero replication overhead, and **hybrid mode** that combines both approaches. In storage/compute separation deployments, clusters integrate seamlessly with Kubernetes HPA or AWS Auto Scaling Groups, enabling highly concurrent continuous queries while scaling automatically with demand. Please refer [Timeplus Architecture](/architecture) for more details. 

Original file line number	Diff line number	Diff line change
`@@ -75,4 +75,4 @@ SQL-based rules can be used to trigger or resolve alerts in systems such as Page`
`75`	`75`
`76`	`76`	`### Scalability and Elasticity`
`77`	`77`
`78`		`-Timeplus supports both MPP architecture for pure on-prem deployments—ideal when ultra-low latency is critical and storage/compute separation for elastic, cloud-native setups. In the latter mode, S3 is used for the NativeLog, Historical Store, and Query State Checkpoints. Combined with Kubernetes HPA or AWS Auto Scaling Groups, this enables highly concurrent continuous queries on clusters that scale automatically with demand.`
	`78`	+Timeplus supports three deployment models: MPP (shared-nothing) for on-premises setups where ultra-low latency is critical, storage/compute separation for elastic cloud-native environments using S3 (or similar object storage) to store the NativeLog, Historical Store, and Query State Checkpoints with zero replication overhead, and hybrid mode that combines both approaches. In storage/compute separation deployments, clusters integrate seamlessly with Kubernetes HPA or AWS Auto Scaling Groups, enabling highly concurrent continuous queries while scaling automatically with demand. Please refer [Timeplus Architecture](/architecture) for more details.