By Shantanu Kaushik
In Part 1, we discussed different real-world organizations ranging from start-ups to multi-national enterprises and how they effectively and efficiently handle traffic loads using Auto Scaling. Alibaba Cloud introduced Auto Scaling to work through traffic load fluctuations and provide organizations with an efficient way to scale their compute resources. This accommodates the changes in network loads, which saves time, energy, and money.
Alibaba Cloud Auto Scaling is a management service that is suitable for any type of business or application. Auto Scaling automatically adjusts the number of elastic compute resources based on the demands of your business and the changes in network traffic load. When business loads increase, Auto Scaling automatically adds Elastic Compute Service (ECS) instances to ensure sufficient computing capabilities for a smoother and available user experience. As these loads decrease, Alibaba Cloud Auto Scaling removes these ECS instances automatically to save costs.
As the figure above shows, Alibaba Cloud Auto Scaling gets cloud data from the real-time network load using CloudMonitor.
Quick Tip: Alibaba Cloud CloudMonitor collects metrics related to Alibaba Cloud resources. The service can detect the availability of your service and allow you to set alarms on specific metrics. CloudMonitor lets you view and comprehend cloud resource usage, along with the status and health of your infrastructure and applications.
Auto Scaling implements a CPU usage scenario to enable scaling in or out. As a default setting, as the CPU usage load passes an 80% threshold, the Auto Scaling service triggers the scale out function to introduce more ECS instances to distribute the load. Similarly, the Auto Scaling service scales in function reduces the number of ECS instances when CPU loads fall below 30%, saving a considerable amount of money.
Auto Scaling helps you write custom policies and profiles to consider and include different metrics and compute load circumstances to engage or disengage Auto Scaling.
Alibaba Cloud Auto Scaling automatically disengages the underlying resources, such as ECS instances, to prevent resource waste and reduce costs. You can configure CloudMonitor to monitor your ECS instance usage in real-time. Alibaba Cloud Auto Scaling automatically scales in ECS resources based on the scaling rules you configure.
During scale in, Auto Scaling automatically releases ECS instances. Then, these ECS instances are removed from the backend server groups of the associated Server Load Balancer (SLB) instances and the whitelists of the associated ApsaraDB RDS instances.
Quick Tip: Server Load Balancer (SLB) is responsible for routing multiple requests and spreading them over server instances in a way that provides high-performance and enables optimized resource usage. The goal here is to ensure that all the server instances can share the load evenly without worrying about any server instance or compute instance (like ECS) taking more of the load than the others.
Alibaba Cloud Auto Scaling automatically increases underlying resources as the business loads increase above the pre-defined threshold. This enables you to maintain performance and resource availability to help users execute their application processes and ensure that resources are not overloaded.
You can configure CloudMonitor to monitor your ECS instance usage in real-time to achieve this. Depending on the collected metrics from CloudMonitor, Auto Scaling scales out ECS resources automatically based on the scaling rules you configure.
During scale out, Alibaba Cloud Auto Scaling creates ECS instances automatically. Then, these ECS instances are added to the backend server groups of the associated Server Load Balancer (SLB) instances and the whitelists of the associated ApsaraDB RDS instances.
While the scale out process is in service, the ECS instances are automatically created based on the pre-defined instance configuration information of the scaling group. The instance configuration information includes the instance type, operating system, and user data to enable quick deployment without complexities.
Alibaba Cloud Auto Scaling provides a host of features and benefits for demanding usage scenarios. A usage overview is listed below:
With Alibaba Cloud Auto Scaling, you can:
Alibaba Cloud Auto Scaling works in three scaling modes:
We will discuss Alibaba Cloud Auto Scaling scaling modes in further detail in Part 3.
Alibaba Cloud Auto Scaling works in various high-demand usage scenarios:
In Part 3 of this 4-part series, we will discuss Elastic Recovery, the Auto Scaling workflow, and scaling modes.
2,599 posts | 762 followers
FollowAlibaba Clouder - April 8, 2021
Alibaba Clouder - April 9, 2021
Alibaba Clouder - April 9, 2021
Alibaba Clouder - April 7, 2021
Alibaba Clouder - April 2, 2021
Alibaba Clouder - April 6, 2021
2,599 posts | 762 followers
FollowAuto Scaling automatically adjusts computing resources based on your business cycle
Learn MoreAPI Gateway provides you with high-performance and high-availability API hosting services to deploy and release your APIs on Alibaba Cloud products.
Learn MoreSet up an all-in-one live shopping platform quickly and simply and bring the in-person shopping experience to online audiences through a fast and reliable global network
Learn MoreGet started on cloud with $1. Start your cloud innovation journey here and now.
Learn MoreMore Posts by Alibaba Clouder