Skip to content

Commit 95dba4d

Browse files
committed
Sharding: Implement suggestions by CodeRabbit
1 parent c28811f commit 95dba4d

File tree

2 files changed

+6
-6
lines changed

2 files changed

+6
-6
lines changed

docs/admin/sharding-partitioning.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -103,9 +103,9 @@ for a benchmark.
103103
- Identify the record size
104104
- Calculate the throughput
105105

106-
Then, to calculate the number of shards, you should consider that the size of each
107-
shard should roughly be between 10 - 50 GB, and that each node can only manage
108-
up to 1000 shards.
106+
Then, to calculate the number of shards, consider that each shard should
107+
roughly be between 10 – 50 GB, and that each node can manage
108+
up to 1_000 shards by default.
109109

110110
:::{caution}
111111
**Over-sharding and over-partitioning**

docs/performance/sharding.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ If you are looking for an intro to sharding, see also the
2929
To avoid running your clusters with too many shards or too large shards,
3030
implement the following guidelines as a rule of thumb:
3131

32-
- Use shard sizes between 10GB and 50GB.
32+
- Use shard sizes between 10 GB and 50 GB.
3333

3434
- Keep the number of records on each shard below 200 million.
3535

@@ -50,7 +50,7 @@ the type of hardware you are using.
5050

5151
## Sizing considerations
5252

53-
General principles requires careful consideration for cluster
53+
General principles require careful consideration of cluster
5454
sizing and architecture.
5555
Keep the following things in mind when building your sharding strategy.
5656
Each shard incurs overhead in terms of open files, RAM allocation, and CPU cycles
@@ -59,7 +59,7 @@ for maintenance operations.
5959
### Shard size vs. number of shards
6060

6161
The optimal approach balances shard count with shard size. Individual shards should
62-
typically contain 10-50 GB of data, being the sweet spot for most
62+
typically contain 10-50GB of data, being the sweet spot for most
6363
workloads. In large clusters, this often means fewer shards than total CPU cores,
6464
as larger shards can still be processed efficiently by multiple CPU cores during
6565
query execution.

0 commit comments

Comments
 (0)