All Products
Search
Document Center

Container Compute Service:Use the cluster inspection feature to identify potential risks

Last Updated:Aug 23, 2024

Alibaba Cloud Container Compute Service (ACS) allows you to use the cluster inspection feature of Container Intelligent Service (CIS) to identify potential risks in ACS clusters. You can use this feature to view the status of clusters and identify potential risks in the clusters, such as insufficient quotas of cloud resources or high usage of key Kubernetes resources. You can troubleshoot issues based on the suggested solutions. This topic describes how to use the cluster inspection feature to identify potential risks in your clusters.

Prerequisites

  • An ACS cluster is created. For more information, see Create an ACS cluster.

  • The cluster is in the Running state.

    To view the status of a cluster, log on to the ACS console and go to the Clusters page. If the Cluster Status column displays Running, the cluster is running as normal.

Configure the cluster inspection feature

Important

When you use the cluster inspection feature, the system runs a data collection program in the cluster to collect inspection results. The program collects the following information, the system version, loads, status of container components, and key error messages in system logs. The program does not collect business information or sensitive data.

  1. Log on to the ACS console. In the left-side navigation pane, click Clusters.

  2. On the Clusters page, find the cluster that you want to manage and click its ID. In the left-side navigation pane, choose Inspections and Diagnostics > Cluster Inspections.

  3. In the Schedule Rule section of the Cluster Inspection page, click Add.

  4. In the Set Inspection Rule pane, specify TimeZone and Regular Task Rule (Daily), read the warning and select I know and agree, and then click Save.

    After you configure the cluster inspection feature, the system inspects the cluster at the scheduled time.

    In the Reports section of the Cluster Inspection page, you can click Start to manually trigger a cluster inspection. After the inspection is complete, the results are displayed in the Reports section.

View inspection results

  1. Log on to the ACS console. In the left-side navigation pane, click Clusters.

  2. On the Clusters page, find the cluster that you want to manage and click its ID. In the left-side navigation pane, choose Inspections and Diagnostics > Cluster Inspections.

  3. In the Reports section of the Cluster Inspection page, click Details in the Operation column of the report that you want to view.

  4. On the Report page, view the risks, impacts, and suggested solutions.

    • Risks are displayed in ascending order of severity levels, including Warning, Error, and Critical. Critical-level risks are highlighted.

    • A cluster inspection report displays the identified risks and their severity levels, impacts, and solutions. For more information about common risks and the suggestions on how to handle the risks, see Check items and solutions.

What to do next

Manage the inspection rule

After you configure cluster inspection, you can perform the following operations in the Schedule Rule section:

  • Click Edit to modify the inspection rule.

  • Click Delete to delete inspection rules that you no longer use. After an inspection rule is deleted, the cluster is no longer inspected based on the rule.

References

We recommend that you use the cluster inspection feature to periodically inspect your clusters to identify potential risks. Then, you can use the cluster diagnostics feature to troubleshoot the identified issues. For more information, see Work with cluster diagnostics.