All Products
Search
Document Center

:Work with cluster diagnostics

Last Updated:Aug 23, 2024

Container Intelligent Service provides the cluster diagnostics feature to allow you to diagnose pods, Services, and Ingresses with one click and help you locate the cause. This topic describes how to use the cluster diagnostics feature in an Alibaba Cloud Container Compute Service (ACS) cluster.

Prerequisites

  • An ACS cluster is created. For more information, see Create an ACS cluster.

  • The ACS cluster is in the Running state.

    Note

    You can log on to the ACS console, go to the Clusters page, and verify that the Cluster Status column displays Running.

Introduction to cluster diagnostics

Container Intelligent Service provides the following diagnostic items.

Diagnostic item

Description

Pod diagnostics

Diagnose pod status issues, such as pod startup failures or frequent pod restarts.

Service diagnostics

Diagnose Service issues, such as Service configurations, resource quotas, and abnormal events.

Ingress diagnosis

Diagnose Ingress-related issues in traffic routing configurations.

Configure diagnostics

Important

When you use the cluster diagnostics feature, the system runs a data collection program on each node in the cluster and collects diagnostic results. The program collects the following information, the system version, loads, status of container components, and key error messages in system logs. The program does not collect business information or sensitive data.

The procedures for configuring pod, Service, and Ingress diagnostics are similar. The following section uses pod diagnostics as an example to demonstrate how to configure the diagnostics feature.

  1. Log on to the ACS console. In the left-side navigation pane, click Clusters.

  2. On the Clusters page, click the name of the cluster that you want to manage. In the left-side navigation pane, choose Inspections and Diagnostics > Diagnostics and follow the on-screen instructions to complete authorization.

  3. On the Diagnosis page, click Pod diagnosis.

  4. On the Pod diagnosis page, click Diagnosis.

  5. In the Select Pod panel, specify Namespace and Pod, read the warning and select I know and agree, and then click Create diagnosis.

    You can view the diagnostic progress on the page. After the diagnostic is complete, the page displays the diagnostic results and diagnostic items. You can check the cause and fix the issues.

View diagnostic results

On the Pod diagnosis page, click Diagnosis details in the Operation column of the diagnostic report in the list to view the diagnostic results on the details page.

Note

The diagnostic items may vary based on the cluster configuration. The actual diagnostic items on the diagnostic page shall prevail.

Diagnostic item

Flag

Description

Pod diagnostics

  • Normal 正常: No operations are required.

  • Warning 警告: Confirm the severity of the issue. Troubleshoot the issues that may cause cluster anomalies.

  • Abnormal 异常: Troubleshoot the issues at the earliest opportunity to avoid cluster errors.

  • Unknown 未知: The diagnostic is not completed or the diagnostic result is unknown.

Pod diagnostics consist of the Pod and ClusterComponent diagnostic items. The system checks for pod anomalies based on pod status and cluster component status. On the diagnostic details page, you can view the pod diagnostic results, repair suggestions, and diagnostic items.

Move the pointer over the 详情图标.png icon to the right of a diagnostic item to view information about the diagnostic item.

Diagnostic items with the Abnormal or Warning flag are displayed on the Troubleshoot tab.

When a diagnostic item displays the Abnormal flag, you can move the pointer over Details in the Status column to view details about the issue.

Service diagnostics

Service diagnostics consist of the Service and ResourceQuotas diagnostic items. These diagnostic items help you identify Service anomalies based on the billing method of Classic Load Balancer (CLB) instances, certificates, quotas, and abnormal events.

Move the pointer over the 详情图标.png icon to the right of a diagnostic item to view information about the diagnostic item.

Diagnostic items with the Abnormal or Warning flag are displayed on the Troubleshoot tab.

When a diagnostic item displays the Abnormal flag, you can move the pointer over Details in the Status column to view details about the issue.

Ingress diagnostics

Ingress diagnostics consist of the Ingress, Addon, and SLB diagnostic items. These diagnostic items help you identify Ingress anomalies based on the status of Ingresses, Ingress plug-ins, and Server Load Balancer (SLB) instances.

Move the pointer over the 详情图标.png icon to the right of a diagnostic item to view information about the diagnostic item.

Diagnostic items with the Abnormal or Warning flag are displayed on the Troubleshoot tab.

When a diagnostic item displays the Abnormal flag, you can move the pointer over Details in the Status column to view details about the issue.