Metric | Definition | Description | Unit | Supported connector |
Num of Restarts | The number of times that a deployment is restarted when a deployment failover occurs. | This metric indicates the number of times that a deployment is restarted when a deployment failover occurs. The number of times that the deployment is restarted when the JobManager failover occurs is excluded. This metric is used to check the availability and status of the deployment. | Count | N/A |
current Emit Event Time Lag | The processing latency. | If the value of this metric is large, a data latency may occur in the deployment when the system pulls or processes data. | Milliseconds | |
current Fetch Event Time Lag | The transmission latency. | If the value of this metric is large, a data latency may occur in the deployment when the system pulls data. In this case, you must check the information about the network I/O or the source. You can analyze the data processing capabilities of a source based on the difference between the values of this metric and the currentEmitEventTimeLag metric. The difference between the values of the two metrics indicates the duration for which the data is retained in the source. The processing mechanism varies based on whether miniBatch is enabled: If the difference between the values of the two metrics is small, the source does not efficiently pull data from the external system due to issues related to network I/O or parallelism. If the difference between the values of the two metrics is large, the processing capability of the deployment is insufficient. This leads to data retention in the source. To resolve this issue, perform the following steps: On the Deployments page, find the deployment that you want to manage and click its name. In the deployment details panel, click the Status tab. On the Status tab, click the value in the Name column. On the page that appears, click the BackPressure tab to locate the Vertex topology that causes the issue. Then, on the BackPressure tab, click Dump in the Thread Dump column to go to the Thread Dump tab to analyze the stack that has a performance bottleneck.
| Milliseconds | |
numRecordsIn | The total number of input data records of all operators. | If the value of this metric does not increase for a long period of time for an operator, data may be missing from the source. In this case, you must check the data of the source. | Count | All built-in connectors |
numRecordsOut | The total number of output data records. | If the value of this metric does not increase for a long period of time for an operator, an error may occur in the code logic of the deployment and data is missing. In this case, you must check the code logic of the deployment. | Count | All built-in connectors |
numRecordsInofSource | The total number of data records that flow into the source operator in each operator. | This metric is used to check the number of data records that flow into the source. | Count | Kafka MaxCompute Incremental MaxCompute ApsaraMQ for RocketMQ Simple Log Service DataHub ElasticSearch Hologres
|
numRecordsOutOfSink | The total number of output data records in the sink. | This metric is used to check the number of data records that are exported by the source. | Count | Kafka Simple Log Service DataHub Hologres ApsaraDB for HBase Tablestore ApsaraDB for Redis
|
numRecordsInPerSecond | The number of input data records per second for all data streams. | This metric is used to monitor the overall data stream processing speed. For example, you can use the value of the numRecordsInPerSecond metric to check whether the overall data stream processing speed meets the expected requirements and how the deployment performance changes under different input data loads. | Data records/s | All built-in connectors |
numRecordsOutPerSecond | The number of output data records per second for all data streams. | This metric is used to monitor the number of output data records per second for all data streams. This metric is also used to monitor the overall data stream output speed. For example, you can use the value of the numRecordsOutPerSecond metric to check whether the overall data stream output speed meets the expected requirements and how the deployment performance changes under different output data loads. | Data records/s | All connectors |
numRecordsInOfSourcePerSecond (IN RPS) | The number of input data records per second in a source. | This metric is used to monitor the number of input data records per second in a source and monitor the speed at which data records are generated in the source. For example, the number of data records that can be generated varies based on the type of the data source. You can use the value of the numRecordsInOfSourcePerSecond metric to check the speed at which data records are generated in a source and adjust data streams to improve performance. This metric is also used for monitoring and alerting. If the value of this metric is 0, data may be missing from the source. In this case, you must check whether data output is blocked because the data of the source is not consumed. | Data records/s | Kafka MaxCompute Incremental MaxCompute ApsaraMQ for RocketMQ Simple Log Service DataHub ElasticSearch Hologres
|
numRecordsOutOfSinkPerSecond (OUT RPS) | The number of output data records per second in a sink. | This metric is used to monitor the number of output data records per second in a sink and monitor the speed at which data records are exported from the sink. For example, the number of data records that can be exported varies based on the type of the sink. You can use the value of the numRecordsOutOfSinkPerSecond metric to check the speed at which data records are exported from a sink and adjust data streams to improve performance. This metric is also used for monitoring and alerting. If the value of this metric is 0, all data is filtered due to a defect in the code logic of the deployment. In this case, you must check the code logic of the deployment. | Data records/s | Kafka MaxCompute Incremental MaxCompute Simple Log Service DataHub Hologres ApsaraDB for HBase Tablestore ApsaraDB for Redis
|
pendingRecords | The number of data records that are not read by the source. | This metric is used to check the number of data records that are not pulled by the source from the external system. | Count | |
sourceIdleTime | The duration for which data is not processed in the source. | This metric specifies whether the source is idle. If the value of this metric is large, your data is generated at a low rate in the external system. | Milliseconds | |