You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: latest/ug/clusters/eks-provisioned-control-plane-introduction.adoc
+36-64Lines changed: 36 additions & 64 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,38 +14,31 @@ Learn how to use Amazon EKS Provisioned Control Plane for predictable and high p
14
14
15
15
Amazon EKS Provisioned Control Plane is a feature that enables cluster
16
16
administrators to select from a set of scaling tiers and
17
-
designate their chosen tier for high, predictale performance from the cluster's control plane. This enables cluster administrators to ensure that the
18
-
cluster's control plane is always provisioned with the specified
19
-
capacity, regardless of actual utilization. Each control plane scaling
20
-
tier offers significantly increased compute capacity and performance
21
-
compared to what was available previously.
22
-
23
-
By default, Amazon EKS cluster control planes scale up and down
24
-
automatically in response to a signals such as
17
+
designate their chosen tier for high, predictale performance from the cluster's control plane. This enables cluster administrators to ensure that the cluster's control plane is always provisioned with the specified
18
+
capacity, regardless of actual utilization.
19
+
20
+
By default, Amazon EKS cluster's control planes scales up and down
21
+
automatically in response to signals such as
25
22
control plane utilization. This is the
26
-
*standard mode* of EKS Control Plane operation and works well for a wide
27
-
range of use cases by dynamically allocating sufficient capacity as
28
-
needed. However, for workloads characterized by unpredictable traffic
29
-
spikes, extreme bursts, or those requiring very high amounts of control
30
-
plane capacity, standard mode may not be ideal. EKS Provisioned Control
31
-
Plane is the right solution for workloads that require predictable, high performance from the control plane that is instantly available.
23
+
*standard mode* of EKS Control Plane operation and works well for the vast majority of use cases by dynamically allocating sufficient capacity as
24
+
needed. However, for workloads characterized by extreme bursts of traffic, or those requiring very high amounts of control
25
+
plane capacity, *Provisioned Control mode* is the right solution.
32
26
33
27
With EKS Provisioned Control Plane, cluster administrators can
34
28
pre-provision desired control plane capacity ahead of time, providing
35
29
predictable and high performance from the cluster's control plane even in
36
-
scenarios with bursty or erratic traffic patterns. Additionally, EKS
37
-
Provisioned Control Plane enables cluster administrators to provision
38
-
the same control plane capacity across environments—from development and
39
-
staging to production and disaster recovery sites. This is crucial for
30
+
scenarios with bursty or erratic traffic patterns. EKS
31
+
Provisioned Control Plane also enables cluster administrators to provision
32
+
the same control plane capacity across environments, from development and
33
+
staging to production and disaster recovery sites. This is important for
40
34
ensuring that the control plane performance obtained across environments
41
35
is consistent and predictable. Finally, EKS Provisioned Control Plane
42
-
gives you access to highly scalable and performant control plane tiers
43
-
that were not previously available, enabling the running of massively
36
+
gives you access to very high levels of control plane performance, enabling the running of massively
44
37
scalable AI workloads, high-performance computing, and large-scale data
45
38
processing workloads on Kubernetes.
46
39
47
40
All existing and new Amazon EKS clusters operate in standard mode by
48
-
default. For clusters requiring predictable and high
41
+
default. For clusters requiring high, predictable
49
42
performance from the control plane, you can opt in to use the EKS
50
43
Provisioned Control Plane feature. You will be billed at the hourly rate
51
44
for the particular control plane scaling tier in addition to the
@@ -58,45 +51,40 @@ image::images/control-plane-modes.png[Amazon EKS Control Plane Modes,scaledwidth
58
51
== Use cases
59
52
60
53
EKS Provisioned Control Plane is designed to address specific scenarios
61
-
where predictable and high control plane performance are
54
+
where high and predictable control plane performance are
62
55
critical to your operations. Understanding these use cases can help you
63
56
determine whether EKS Provisioned Control Plane is the right solution
64
57
for your workloads.
65
58
66
59
*Performance-critical workloads* – For workloads that demand minimal
67
60
latency and maximum performance from the Kubernetes control plane, EKS
68
61
Provisioned Control Plane provides capacity that eliminates
69
-
the variability inherent in reactive scaling. This is particularly
70
-
important for applications where control plane responsiveness directly
71
-
impacts user experience or business outcomes.
62
+
performance variability with control plane scaling.
72
63
73
64
*Massively scalable workloads* – If you run highly scalable workloads
74
65
such as AI training and inference, high-performance computing, or
75
66
large-scale data processing that require large number of nodes running
76
-
in the cluster, EKS Provisioned Control Plane provides the necessary
77
-
control plane capacity to support these demanding workloads. The
78
-
higher-tier scaling options ensure your control plane can handle the
79
-
volume of API requests and scheduling operations required by large-scale
80
-
distributed computing.
67
+
in the cluster, Provisioned Control Plane provides the necessary
68
+
control plane capacity to support these demanding workloads.
81
69
82
70
*Anticipated high-demand events* – When you expect a sudden surge in
83
71
control plane requests due to an upcoming event such as e-commerce sales
84
72
or promotions, product launches, holiday shopping seasons, or major
85
-
sporting or entertainment events, EKS Provisioned Control Plane allows
73
+
sporting or entertainment events, Provisioned Control Plane allows
86
74
you to scale your control plane capacity in advance. This proactive
87
75
approach ensures your control plane is ready to handle the increased
88
76
load without waiting for automatic scaling to respond to demand.
89
77
90
-
*Environment consistency* – EKS Provisioned Control Plane enables you to
91
-
match control plane capacity and performance across development and
78
+
*Environment consistency* – Provisioned Control Plane enables you to
79
+
match control plane capacity and performance across staging and
92
80
production environments, helping you identify potential issues early
93
81
before deployment to production. By maintaining the same control plane
94
82
tier across environments, you can ensure that testing results accurately
95
83
reflect production behavior, reducing the risk of performance-related
96
84
surprises during rollout.
97
85
98
86
*Disaster recovery and business continuity* – For disaster recovery
99
-
scenarios, EKS Provisioned Control Plane allows you to provision
87
+
scenarios, Provisioned Control Plane allows you to provision
100
88
failover environments with the same level of capacity as your primary
101
89
environment. This ensures minimal disruption and quick recovery during
102
90
failover events, as your disaster recovery cluster will have identical
@@ -107,28 +95,19 @@ from the moment it's activated.
107
95
108
96
EKS Provisioned Control Plane offers scaling tiers named using t-shirt
109
97
sizes (XL, 2XL, 4XL). Each tier defines its capacity through three key
110
-
attributes that determine the performance characteristics of your
98
+
Kubernetes attributes that determine the performance characteristics of your
111
99
cluster's control plane. Understanding these attributes helps you select
112
100
the appropriate tier for your workload requirements.
113
101
114
102
*API request concurrency* measures the number of requests that the
115
-
Kubernetes control plane's API server can process concurrently. This
116
-
attribute determines how many simultaneous API operations your cluster
117
-
can handle, which is critical for high throughput workloads.
103
+
Kubernetes control plane's API server can process concurrently, which is critical for high throughput workloads.
118
104
119
105
*Pod scheduling rate* indicates how quickly the default Kubernetes
120
-
scheduler can schedule pods on nodes, measured in pods per second. This
121
-
attribute is particularly important for workloads that frequently create
122
-
and destroy pods, such as batch processing jobs or auto-scaling
123
-
applications.
124
-
125
-
*Cluster database size* indicates the total amount of storage space
126
-
available in the etcd database used to store cluster state, measured in
127
-
gigabytes. This attribute determines how many Kubernetes objects (pods,
128
-
services, deployments, etc.) your cluster can store.
129
-
130
-
When you provision your cluster's control plane on a certain scaling tier using
131
-
EKS Provisioned Control Plane, Amazon EKS ensures your cluster's control
106
+
scheduler can schedule pods on nodes, measured in pods per second.
107
+
108
+
*Cluster database size* indicates the storage space allocated to etcd, the database that holds the cluster state/metadata.
109
+
110
+
When you provision your cluster's control plane on a certain scaling tier using Provisioned Control Plane, AEKS ensures your cluster's control
132
111
plane maintains the limits corresponding to that tier. The
133
112
limits of control plane scaling tiers vary by Kubernetes version,
134
113
as shown in the following tables.
@@ -208,25 +187,20 @@ are scrapable from your EKS cluster's Prometheus endpoint (see <<prometheus,here
NOTE: The etcd_mvcc_db_total_size_in_use_in_bytes metric is currently being rolled out gradually across all EKS clusters. The metric is expected to be available in all clusters by the end of Q2 2026
217
-
218
194
=== Understanding Tier capacity versus actual performance
219
195
220
-
When you select a Provisioned Control Plane scaling tier, the tier attributes represent the underlying configurations that Amazon EKS applies to your control plane. However, the actual performance you achieve depends on your specific workload patterns, configurations, and adherence to Kubernetes best practices. For example, while a 4XL tier configures API Priority and Fairness (APF) with 6,800 concurrent request seats, the actual request throughput you obtain from the control plane depends on the types of operations being performed. For example, list operations consume more resources than get operations, and cluster-wide queries are more expensive than namespace-scoped queries (see https://docs.aws.amazon.com/eks/latest/best-practices/scale-control-plane.html#_api_priority_and_fairness[here]). Similarly, although the default scheduler QPS is set to 400 for a 4XL tier, your actual pod scheduling rate depends on factors like node readiness and health. To achieve optimal performance, ensure your applications follow Kubernetes best practices and are properly configured for your workload characteristics.
196
+
When you select a Provisioned Control Plane scaling tier, the tier attributes represent the underlying configurations that Amazon EKS applies to your control plane. However, the actual performance you achieve depends on your specific workload patterns, configurations, and adherence to Kubernetes best practices. For example, while a 4XL tier configures API Priority and Fairness (APF) with 6,800 concurrent request seats, the actual request throughput you obtain from the control plane depends on the types of operations being performed. For example, Kuberentes penalizes list requests more than get, and hence the effective number of list requests processed concurrently by control plane is lower than get requests (see https://docs.aws.amazon.com/eks/latest/best-practices/scale-control-plane.html#_api_priority_and_fairness[here]). Similarly, although the default scheduler QPS is set to 400 for a 4XL tier, your actual pod scheduling rate depends on factors like nodes being ready and health for scheduling. To achieve optimal performance, ensure your applications follow Kubernetes best practices and are properly configured for your workload characteristics.
221
197
222
198
== Considerations
223
199
224
200
* *Standard control plane capacity* – EKS standard control plane mode
225
-
uses a flexible fleet of instances in the backend to offer the best
226
-
price to performance ratio. However this means that the compute
227
-
capabilities delivered can vary, so the actual performance limits are
228
-
not fixed and can fluctuate. While these values can occasionally scale
229
-
to levels comparable to the lower EKS Provisioned Control Plane tiers, they may not be available in all clusters. If your workload requires predictable, high
201
+
offers the best
202
+
price to performance ratio for the vast majority of use cases. However the performance vary as control plane scales to meet demand.
203
+
If your workload requires predictable, high
230
204
control plane performance with clearly defined minimum
231
205
capabilities, you should consider upgrading to an EKS Provisioned
232
206
Control Plane tier.
@@ -237,7 +211,7 @@ Provisioned Control Plane scaling tiers.
237
211
* *Exiting restriction* – You cannot exit an EKS Provisioned Control
238
212
Plane scaling tier if your cluster exceeds the cluster database size
239
213
limit supported in standard mode. For example, if you are using the XL
240
-
scaling tier with 16 GB of cluster database storage (etcd database
214
+
scaling tier with 14 GB of cluster database storage (etcd database
241
215
size), you cannot exit this tier until you lower the database
242
216
utilization to less than 8 GB. You must first reduce your storage
243
217
consumption below 8 GB before you can exit the XL tier.
@@ -275,6 +249,4 @@ provisioning capacity.
275
249
* *Kuberentes version and region support* – EKS Provisioned Control Plane is supported in all Amazon Web Services commercial, GovCloud, and China regions. Provisioned Control Plane works on EKS v1.28 and higher.
0 commit comments