A quota sets the maximum usage of a resource or the maximum QPS of a service within a time period. Quotas are commonly used to manage resource allocation and consumption. The method to calculate an Application Load Balancer (ALB) quota varies based on the resource type and resource usage. This topic describes the methods to calculate the ALB quotas related to standard ALB instances, backend server groups, listeners, and forwarding rules.
Scenario (see the preceding figure)
ALB instances use Ingresses to manage and forward external requests. Ingresses define rules that are used to forward requests to backend server groups (Service:port pairs). Then, the requests are sent to and processed by backend applications, which run in a group of pods. The mappings between ALB instances, Ingresses, backend server groups (Service:port pairs), and pods together comprise a routing system for request forwarding and load balancing.
The following table describes the methods to calculate the ALB quotas related to standard ALB instances, backend server groups, listeners, and forwarding rules.
ALB quotas related to standard ALB instances
Quota description | Name/ID | Calculation method | Scenario (see the preceding figure) |
Maximum number of additional certificates that can be added to an ALB instance (excluding default certificates) | alb_quota_loadbalancer_certificates_num_standard_edition | The maximum number of additional certificates that can be added to an ALB instance equals the total number of additional certificates that can be added to all listeners of the ALB instance. The number of additional certificates that can be added to an ALB Ingress varies based on how the certificates are configured:
|
|
Maximum number of forwarding rules that can be configured for an ALB instance (excluding default forwarding rules) | alb_quota_loadbalancer_rules_num_standard_edition | The maximum number of forwarding rules that can be configured for an ALB instance equals the total number of forwarding rules of the ALB Ingresses associated with all listeners of the ALB instance.
The number of forwarding rules of an ALB Ingress equals the number of entries listed in the |
|
Maximum number of backend servers that can be added to an ALB instance | alb_quota_loadbalancer_servers_num_standard_edition | The maximum number of backend servers that can be added to an ALB instance equals the total number of backend servers of the ALB Ingresses associated with all listeners of the ALB instance.
The number of backend servers of an ALB Ingress equals the total number of the backend pods specified in all forwarding rules of the ALB Ingress. If the ALB Ingress is associated with multiple listeners, the backend pods of the ALB Ingress are counted multiple times based on the number of listeners. |
|
Maximum number of listeners that can be added to an ALB instance | alb_quota_loadbalancer_listeners_num_standard_edition | The number of listeners that are added to an ALB instance equals the number of The number of listeners that are associated with an ALB Ingress depends on the value of the |
|
Quotas related to server groups
Quota description | Name/ID | Calculation method | Scenario (see the preceding figure) |
Maximum number of ALB server groups in which a backend server (IP address) can be specified | alb_quota_server_added_num | If a pod IP address is specified as the backend server of a Service:port pair and the Service:port pair is specified in multiple forwarding rules, the pod IP address is counted multiple times based on the number of forwarding rules. In this case, if each of the preceding forwarding rules is associated with multiple listeners, the pod IP address is also counted multiple times based on the number of listeners. |
|
Maximum number of times that an ALB server group can be associated with listeners and forwarding rules | alb_quota_servergroup_attached_num | The maximum number of times that an ALB server group (Service:port pair) can be associated with listeners and forwarding rules depends on the forwarding rules in which the ALB server group (Service:port pair) is specified. If a forwarding rule in which the ALB server group (Service:port pair) is specified is associated with multiple listeners, the ALB server group (Service:port pair) is counted multiple times based on the number of listeners. |
|
Maximum number of backend servers (IP addresses and ports) that can be added to a server group (Service:port pair) | alb_quota_servergroup_servers_num | The maximum number of backend servers (IP addresses and ports) that can be added to a server group equals the number of pod:port pairs of the server group (Service:port pair). |
|
Quotas related to listeners
Quota description | Name/ID | Calculation method | Scenario (see the preceding figure) |
Maximum number of network access control lists (ACLs) that can be associated with a listener | - |
The maximum number of network ACLs that can be associated with a listener depends on the total number of entries in the non-empty aclConfig fields of all |
|
Maximum number of network ACL entries that can be associated with a listener | - |
The maximum number of network ACL entries that can be associated with a listener depends on the total number of entries in the non-empty aclConfig fields of all |
|
Quotas related to forwarding rules
Quota description | Name/ID | Calculation method | Scenario (see the preceding figure) |
Maximum number of actions that can be specified in a forwarding rule | -- |
|
|
Maximum number of match conditions that can be specified in a forwarding rule | alb_quota_rule_matchevaluations_num | When you create or update a forwarding rule, the maximum number of match conditions that can be specified in the forwarding rule equals the sum of the number of non-empty hosts in the forwarding rule, the number of path match conditions, and the number of match conditions for the custom forwarding conditions specified by using annotations. If you set |
|
Maximum number of wildcard characters that can be used in a forwarding rule | - | When you create or update a forwarding rule, the maximum number of wildcard characters that can be used in the forwarding rule equals the total number of wildcard characters contained in the actions and match conditions specified in the forwarding rule. | ALB Ingress 2 has one forwarding rule and the host match condition in the forwarding rule has only one wildcard character, which is an asterisk (*). In this case, the maximum number of wildcard characters that can be used in the forwarding rule is one. |
Create an alert rule for a quota item
You can create alert rules for some quota items by specifying a threshold for quota usage or available quota. If the usage of a quota reaches the specified threshold, the system sends an alert notification to the callback URL that you specified in the alert rule through an HTTP POST request. We recommend that you take the alerts into consideration and apply for a quota increase in advance to avoid reconciliation failures due to quota exceedance. Such failures can result in forwarding rules or backend nodes failing to mount to the ALB. Additionally, you can monitor the status of reconciliation by using the kubectl describe
and kubectl get event
commands to view details and events related to resources such as AlbConfig, Ingresses, and Services in the cluster.
Procedure (Quota Center)
Log on to the Quota Center console.
Use one of the following methods to create an alert rule:
Method 1: On the Products with General Quotas page, click Application Load Balancer in the Networking section, and then click the General Quota tab.
Method 2: In the left-side navigation pane, click Quota Alerts. On the Quota Alerts page, click Create Quota Alert Rule. On the General Quotas page, select Application Load Balancer from the Product Name drop-down list.
On the General Quotas page, find the quota that you want to manage and click Create Alert Rule in the Actions column.
In the Create Quota Alert Rule panel, configure the parameters and click OK. The following table describes the parameters.
Parameter
Description
Alarm Rule Name
The name of the alert rule.
Alarm Metric
The metric used by the alert rule. You can select Used Quotas, Percentage of Used Quotas, Available Quotas, or Percentage of Available Quotas.
Threshold
The alert threshold. Configure this parameter based on the Alarm Metric parameter.
When the Alarm Metric parameter is set to Used Quotas, alert notifications are sent to the current Alibaba Cloud account if the quota usage of the specified quota item reaches or exceeds the threshold.
When the Alarm Metric parameter is set to Percentage of Used Quotas, alert notifications are sent to the current Alibaba Cloud account if the quota usage of the specified quota item in percentage reaches or exceeds the threshold. Valid values: [50%,100%].
When the Alarm Metric parameter is set to Available Quotas, alert notifications are sent to the current Alibaba Cloud account if the remaining quota of the specified quota item is less than or equal to the threshold.
When the Alarm Metric parameter is set to Percentage of Available Quotas, alert notifications are sent to the current Alibaba Cloud account if the percentage of the remaining quota of the specified quota item is less than or equal to the threshold. Valid values: (0%,50%].
Alarm Callback
The webhook URL to which Quota Center sends an alert notification by using an HTTP POST request.
For more information about the examples of alert callback requests and parameters in an alert callback request, see What is the request content of an alert callback?
NoteBy default, Quota Center sends an alert notification about 15 minutes after the specified alert threshold is reached.
If your alert callbacks are integrated with the webhook of a DingTalk chatbot, you must set the custom keyword of the DingTalk chatbot to Alert and then copy the webhook URL as the URL to receive alert callbacks.
In the left-side navigation pane, click Quota Alerts. On the Quota Alerts page, view the details about the alert rules.
On the Quota Alerts page, you can manage alert rules. For example, you can view, modify, and delete alert rules.
Optional. View the result of an alert callback.
If you configure the Alert Callback parameter, you can view the alert callback records and applications that are automatically submitted to increase the quota after the alert callback succeeds.
Procedure (SLB console)
Log on to the SLB console. In the left-side navigation pane, click Quota Center.
On the Quota Center page, click the ALB tab.
In the Quota Type section, click the General Quota tab, find the quota that you want to manage, and then click Create Alert Rule in the Actions column.
In the Create Quota Alert Rule panel, configure the parameters and click OK. The following table describes the parameters.
Parameter
Description
Alarm Rule Name
The name of the alert rule.
Alarm Metric
The metric used by the alert rule. You can select Used Quotas, Percentage of Used Quotas, Available Quotas, or Percentage of Available Quotas.
Threshold
The alert threshold. Configure this parameter based on the Alarm Metric parameter.
When the Alarm Metric parameter is set to Used Quotas, alert notifications are sent to the current Alibaba Cloud account if the quota usage of the specified quota item reaches or exceeds the threshold.
When the Alarm Metric parameter is set to Percentage of Used Quotas, alert notifications are sent to the current Alibaba Cloud account if the quota usage of the specified quota item in percentage reaches or exceeds the threshold. Valid values: [50%,100%].
When the Alarm Metric parameter is set to Available Quotas, alert notifications are sent to the current Alibaba Cloud account if the remaining quota of the specified quota item is less than or equal to the threshold.
When the Alarm Metric parameter is set to Percentage of Available Quotas, alert notifications are sent to the current Alibaba Cloud account if the percentage of the remaining quota of the specified quota item is less than or equal to the threshold. Valid values: (0%,50%].
Alarm Callback
The webhook URL to which Quota Center sends an alert notification by using an HTTP POST request.
For more information about the examples of alert callback requests and parameters in an alert callback request, see What is the request content of an alert callback?
NoteBy default, Quota Center sends an alert notification about 15 minutes after the specified alert threshold is reached.
If your alert callbacks are integrated with the webhook of a DingTalk chatbot, you must set the custom keyword of the DingTalk chatbot to Alert and then copy the webhook URL as the URL to receive alert callbacks.
Find the quota that you want to manage and choose
in the Actions column.In the Alerts dialog box, you can view the alert rules.