This topic describes the cluster, capacity, and concurrency limits of Alibaba Cloud Container Compute Service (ACS) and how to request a quota increase.
Overview of limits
Before you use Alibaba Cloud Container Compute Service (ACS), note the following limits:
After you create an ACS cluster, the following changes are not supported:
Change the VPC of the cluster.
Change the container network plugin.
Change the storage plugin.
Migrate applications between different namespaces.
The following traffic limits apply when you access cluster management components:
When you access cluster management components, such as API Server and etcd, through an API or the command line, the access bandwidth is restricted. If you attempt to read many cluster events at once, your request may be throttled and may fail. To avoid this issue, you can query cluster events in the ACS console by choosing Operations Management > Event Center. You can also add a paging parameter, such as
--chunk-size=500, to your API call or command to reduce the request size.
Quotas
Type | Quota limit | Default quota | How to increase the quota |
Total number of ACS clusters that can be created by a single account | 20 | 3 | Go to Quota Center to request an increase in the default quota. Select the quota named Total number of ACS clusters. Note If the quota limit is still insufficient, you can submit a ticket to contact technical support to increase your quota. Select Container Service For Kubernetes as the product category. |
Maximum total number of pay-as-you-go elastic vCPUs per region | 30,000 vCPUs | 100 vCPUs | Go to Quota Center and select the target Region to request an increase in the default quota. Note If the quota limit is still insufficient, you can submit a ticket to contact technical support to increase your quota. Select Container Compute Service as the product category. |
Capacity limits
Type | Quota limit | How to increase the quota |
etcd storage capacity | 8 GB | Cannot be increased |
Total size of each type of etcd object | 800 MB | Cannot be increased |
Pod | 50,000 | By default, requests that exceed 50,000 are rejected. You can submit a ticket to request technical support to increase your quota. |
Services | 10,000 | If you create more than 10,000 items, the request is rejected. You can submit a ticket to request technical support to increase your quota. Note If a single namespace contains many services, kubelet may inject many environment variables. This can cause pods to start slowly or fail to start. To resolve these performance bottlenecks, you can disable this injection by setting |
ConfigMap | 30,000 | You can submit a ticket to request technical support to increase your quota. |
Secret | 10,000 units | |
PVC | 10,000 | |
Persistent Volume (PV) | 10,000 | |
Total number of CRs of a single type | 100,000 | The default upper limit of custom resources that belong to the same type is 10,000. You can submit a ticket to increase the quota. |
Total number of CRDs | 100,000 | Cannot be increased |
Concurrency limits
Cluster management
Queries Per Second (QPS) is the number of requests received per second.
Type | Concurrency limit | Default concurrency | How to increase concurrency |
Pod full list QPS | 1 | 1 | Frequently pulling all pod core data places significant pressure on the management components. To reduce the amount of data pulled in a single request, optimize your code by adding a |
Pod create QPS | 300 | 300 | If the QPS exceeds 300, requests are rejected by default and return HTTP status code 429. You can submit a ticket to request technical support to increase the concurrency limit. |
Pod delete QPS | 300 | 300 | |
Pod patch QPS | 300 | 300 | |
Pod update QPS | 300 | 300 | |
Services create QPS | 20 | 20 | If the QPS exceeds 20, requests are rejected by default and an HTTP status code of 429 is returned. You can submit a ticket to increase the concurrency. |
Services delete QPS | 30 | 30 | An HTTP 429 status code is returned if the QPS exceeds 30. You can submit a ticket to request an increase in the concurrency limit. |
Services patch QPS | 30 | 30 | |
Services update QPS | 30 | 30 | |
Lease update QPS | 2000 | 2000 | If the number of queries per second (QPS) exceeds 2000, requests are rejected by default and return HTTP status code 429. You can submit a ticket to request technical support to increase your concurrency limit. |
Configmap update QPS | 200 | 200 | If the QPS exceeds 200, requests are rejected by default and return an HTTP status code of 429. You can submit a ticket to request technical support to increase the concurrency. |
Endpoint update QPS | 200 | 200 | |
Other create QPS | 500 | 500 | If the QPS exceeds 500, requests are rejected by default and return HTTP status code 429. You can submit a ticket to request technical support to increase your concurrency limit. |
Other delete QPS | 500 | 500 | |
Other patch QPS | 500 | 500 | |
Other update QPS | 500 | 500 |
Image cache
The following table lists the maximum number of times each API operation can be called per minute in a single region.
API operation | Maximum calls per minute |
CreateImageCache | 300 |
DeleteImageCache | 300 |
ListImageCaches | 600 |
GetImageCache | 6000 |
Instance delivery throughput
The following table lists the maximum throughput for instance-related operations per minute in a single region. This includes internal retries.
Instance-related capability | Maximum pod delivery throughput per minute |
Instance creation | 1000 |
Instance deletion | 1000 |
Quota limits of dependent cloud products
Limit category | Limit item | Default quota | How to increase the quota |
The maximum number of custom route entries per route table (excluding dynamically propagated route entries) | 200 | Go to the quota management page or Quota Center to request a quota increase. | |
The number of routes that are dynamically propagated to a route table. | 500 | ||
The number of vSwitches that can be created in a VPC. | 150 | ||
The number of VPCs that can be created in a region. | 10 | ||
The number of private IP addresses that can be used by cloud resources in a VPC. | 300,000 1. If an ECS instance has only one private IP address, the ECS instance uses only one network address. 2. If an ECS instance is associated with multiple ENIs or multiple IP addresses are configured for the ENIs, the number of network addresses used by the ECS instance is the sum of the IP addresses that are assigned to the ENIs associated with the ECS instance. | Cannot be increased. | |
Number of private IP addresses that a single VPC-type security group can contain for a single Alibaba Cloud account in a specific region |
| Cannot be increased. | |
Maximum number of ENIs (secondary ENIs) that a single Alibaba Cloud account can create in a specific region | View in Quota Center. | Go to Quota Center to request a quota increase. | |
Maximum number of EIPs that each Alibaba Cloud account can apply for | 20 | Go to Quota Center to request a quota increase. | |
Maximum number of CLB instances that can be created by each Alibaba Cloud account | 30 | You can request a quota increase by using one of the following methods:
| |
Maximum number of backend servers that can be added to a CLB instance | 200 | You can request a quota increase by using one of the following methods:
| |
Maximum number of listeners that can be added to a CLB instance | 50 | You can request a quota increase by using one of the following methods:
| |
Quota for the number of pay-as-you-go cloud disks for a single account across all regions | View in Quota Center. | You can submit a ticket to increase your quota. | |
Quota for the capacity of pay-as-you-go cloud disks used as data disks for a single account | The quota depends on the Elastic Compute Service usage, region, and cloud disk type. You can view the quota in Quota Center. For more information, see View or increase block storage quotas. | You can submit a ticket to increase your quota. |