Use ALB Ingresses to enable automatic application scaling based on QPS

0.0.201

Application Load Balancer (ALB) Ingresses support automatic application scaling based on the QPS values collected by the ALB instances. This ensures the stability of the application and controls resource costs. This topic describes how to use ALB Ingresses to enable automatic application scaling based on QPS.

Prerequisites

alibaba-cloud-metrics-adapter 2.3.0 or later is installed. For more information, see Implement horizontal auto scaling based on Alibaba Cloud metrics.
The ALB Ingress controller is installed. For more information, see Manage the ALB Ingress controller.
The stress testing tool Apache Benchmark is installed. For more information, see Apache Benchmark.
A Log Service project is created. For more information, see Manage a project.
Two vSwitches that are deployed in different zones of the virtual private cloud (VPC) where your cluster resides. For more information, see Create and manage a vSwitch.

Procedure

Create an application and a Service.
Create an ALB Ingress.
Create a Horizontal Pod Autoscaler (HPA).
Verify that the application can be automatically scaled based on the QPS value.

Step 1: Create an application and a Service

Create a file named tea.yaml and copy the following content to the file:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: nginx-deployment-basic
  labels:
    app: tea
spec:
  replicas: 2
  selector:
    matchLabels:
      app: tea
  template:
    metadata:
      labels:
        app: tea
    spec:
      containers:
      - name: tea
        image: nginx:1.7.9
        ports:
        - containerPort: 80
---
apiVersion: v1
kind: Service
metadata:
  name: tea-svc
  namespace: default
spec:
  ports:
    - port: 80
      protocol: TCP
      targetPort: 80
  selector:
    app: tea
  type: NodePort

Run the following command to create an application and a Service:
```
kubectl apply -f tea.yaml
```

Step 2: Create an ALB Ingress

Create an AlbConfig object.
1. Create a file named alb-test-test.yaml and copy the following content to the file:
```
apiVersion: alibabacloud.com/v1
kind: AlbConfig
metadata:
  name: alb-demo
spec:
  config:
    name: alb-test
    addressType: Internet
    zoneMappings:
    - vSwitchId: vsw-uf6ccg2a9g71hx8go****
    - vSwitchId: vsw-uf6nun9tql5t8nh15****
    accessLogConfig:
      logProject: "****"
      logStore: "alb_****"
```
  - zoneMappings: Specify at least two vSwitch IDs for the ALB Ingress. The vSwitches that you specify must be deployed in different zones of the VPC where your cluster resides.
  - logProject: Specify the name of the Log Service project that you created.
  - logStore: Specify the name of the Logstore that you want to use. The value of logStore must start with alb_. If the Logstore that you specify does not exist, the system automatically creates one with the name that you specified.
2. Run the following command to create an AlbConfig object:
```
kubectl apply -f alb-test.yaml
```

Create an IngressClass.

Create a file named alb.yaml and copy the following content to the file:

apiVersion: networking.k8s.io/v1
kind: IngressClass
metadata:
  name: alb
spec:
  controller: ingress.k8s.alibabacloud/alb
  parameters:
    apiGroup: alibabacloud.com
    kind: AlbConfig
    name: alb-demo

Run the following command to create an IngressClass:
```
kubectl apply -f alb.yaml
```

Create an ALB Ingress.

Create a file named tea-ingress.yaml and copy the following content to the file:

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: tea-ingress
spec:
  ingressClassName: alb
  rules:
   - host: demo.ingress.top
     http:
      paths:
      - path: /tea
        pathType: Prefix
        backend:
          service:
            name: tea-svc
            port:
              number: 80

Run the following command to create an ALB Ingress:
```
kubectl apply -f tea-ingress.yaml
```

Run the following command to query the ADDRESS parameter of the ALB Ingress:

kubectl get ingress

Expected output:

NAME                    CLASS   HOSTS                     ADDRESS                                              PORTS   AGE
tea-ingress             alb     demo.ingress.top          alb-110zvs5nhsvfv*****.cn-chengdu.alb.aliyuncs.com   80      7m5s

Step 3: Create an HPA

Create a file named hpa.yaml and copy the following content to the file:

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
metadata:
  name: ingress-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: nginx-deployment-basic
  minReplicas: 2
  maxReplicas: 10
  metrics:
    - type: External
      external:
        metric:
          name: sls_alb_ingress_qps
          # The sls_alb_ingress_qps metric is used to configure auto scaling based on QPS values. 
          selector:
            matchLabels:
              sls.project: "****"    # Replace the value of sls.project with the Log Service project that you want to use. 
              sls.logstore: "alb_****"     # Replace the value with the Logstore that you want to use. 
              sls.ingress.route: "default-tea-svc-80"
              # Specify the value of the sls.ingress.route parameter in the <namespace>-<svc>-<port> format. Example: default-nginx-80. 
        target:
          type: AverageValue
          # The type parameter is set to AverageValue, which indicates that the average QPS value of each pod is used to determine whether to perform scaling activities. 
          averageValue: 2

Run the following command to create an HPA:
```
kubectl apply -f hpa.yaml
```

Run the following command to query information about the HPA:

kubectl get hpa

Expected output:

NAME          REFERENCE                           TARGETS     MINPODS   MAXPODS   REPLICAS   AGE
ingress-hpa   Deployment/nginx-deployment-basic   0/2 (avg)   2         10        2          4h34m

Run the following command to query information about the HPA:

kubectl describe hpa ingress-hpa

Expected output:

Name:                                            ingress-hpa
Namespace:                                       default
Labels:                                          <none>
Annotations:                                     <none>
CreationTimestamp:                               Tue, 31 Jan 2023 11:35:01 +0800
Reference:                                       Deployment/nginx-deployment-basic
Metrics:                                         ( current / target )
"sls_alb_ingress_qps" (target average value):    0 / 2
Min replicas:                                    2
Max replicas:                                    10
Deployment pods:                                 2 current / 2 desired

Step 4: Verify that the application can be automatically scaled based on the QPS value

Verify that the application can be automatically scaled out based on the QPS value.
1. Run the following commands to perform stress tests on the application:
```
ab -c 5 -n 5000 -H Host:demo.ingress.top http://alb-110zvs5nhsvfv*****.cn-chengdu.alb.aliyuncs.com/tea
```
2. After the stress test is complete, run the following command to check whether the application is scaled out:
```
kubectl get hpa
```
  Expected output:
```
NAME          REFERENCE                           TARGETS           MINPODS   MAXPODS   REPLICAS   AGE
ingress-hpa   Deployment/nginx-deployment-basic   12500m/2 (avg)   2         10        10         15m
```
  The value of the REPLICAS parameter is 10. This indicates that the number of pods created for the application is increased to 10 as the QPS value increases.
Verify that the application can be automatically scaled in based on the QPS value.
After the stress test is complete, the QPS value is decreased to 0, which is below the scale-in threshold. HPA automatically scales in the application.
After the stress test is complete, run the following command to check whether the application is scaled in:
```
kubectl get hpa
```
Expected output:
```
NAME          REFERENCE                           TARGETS           MINPODS   MAXPODS   REPLICAS   AGE
ingress-hpa   Deployment/nginx-deployment-basic   0/2 (avg)         2         10         2         60m
```
The value of the REPLICAS parameter is 2. This indicates that the number of pods created for the application is decreased as the QPS value decreases.

Feedback

Previous: Configure certificates for encrypted communication over HTTPSNext: Manage SSL certificates associated with an HTTPS listener

On this page （1, T）

Prerequisites

Procedure

Step 1: Create an application and a Service

Step 2: Create an ALB Ingress

Step 3: Create an HPA

Step 4: Verify that the application can be automatically scaled based on the QPS value

Chat now with Alibaba Cloud Customer Service to assist you in finding the right products and services to meet your needs.

Prerequisites

Procedure

Step 1: Create an application and a Service

Step 2: Create an ALB Ingress

Step 3: Create an HPA

Step 4: Verify that the application can be automatically scaled based on the QPS value

Sales Support

Technical Support

Connect & Report Abuse

About Alibaba Cloud

Our Global Network

Quick Start

Global Offices

Olympic Games Paris 2024 New

Stade Roland Garros – Glitz from the Past New

Place de la Concorde – “Breaking” the Barriers New

Vaires-sur-Marne Nautical Stadium – Sports with Sustainability New

International Broadcast Center – Images, Sounds, and Data that Captivate Billions New

Customer Success Stories New

Trust Center

Security & Compliance Center

Cloud Compliance Resources

Security Compliance FAQs

Product & Feature Update New

Cloud Forward

Press Room

Alibaba Cloud e-Magazine New

Alibaba Cloud in Analyst Research

Notice

Go Global Service New

Go Global Alliance with Alibaba Cloud

Asia Accelerator Hot

Information Compliance

China Gateway - MLPS 2.0 Compliance New

China Gateway - Networking

China Gateway - Global Application Acceleration New

China Gateway - Security

China Gateway - Data Security New

ICP Support Hot

China Gateway - Omnichannel Data Mid-End New

China Gateway - Organizational Data Mid-End New

China Gateway - Business Mid-End New

China Gateway - AI Service for Conversational Chatbots New

China Gateway - Online Education

China Gateway - Domain Registration

Work at Alibaba Cloud

Experienced Professionals

Students and Graduates

Free Trial

Pricing

Promo Center

Price Reduction

Pay Less and Deploy More

FinOps

Elastic Compute Service (ECS)

Simple Application Server (SAS)

Elastic GPU Service

Elastic Desktop Service (EDS)

Object Storage Service (OSS)

Cloud Enterprise Network (CEN)

Web Application Firewall (WAF)

Domain Names

Container Compute Service (ACS)

Secure Access Service Edge (SASE)

Intelligent Media Services(IMS)

Edge Security Acceleration (ESA)(Original DCDN)

Intelligent Media Management

DingTalk Enterprise

YiDA

Alibaba Cloud Model Studio

Apsara Prime - For Easy Cloud Product Selection

Alibaba Cloud ECS - Cater All Your Cloud Hosting Needs

1TB CDN—Get Free 1 TB Outbound Traffic Plan Now

Security—Under Attack? Get Free Security Support

Short Message Service - Free Testing is Available

Elastic Compute Service (ECS) Hot

CloudBox

Compute Nest

Dedicated Host Hot

ECS Bare Metal Instance

Elastic GPU Service Featured

Simple Application Server (SAS) Hot

Auto Scaling

Cloud Phone Beta

Elastic Desktop Service (EDS) Featured

Batch Compute

Elastic High Performance Computing (E-HPC)

Super Computing Cluster (SCC)

Function Compute (FC)