All Products
Search
Document Center

DataWorks:Manage real-time compute tasks

Last Updated:Oct 28, 2024

The Real-time Computing Nodes page of the DataWorks console displays all real-time compute tasks. You can view the basic information and running information about the tasks on this page. You can also configure alert rules for the tasks that you want to monitor. This way, you can identify and handle exceptions at the earliest opportunity if an error occurs on a task.

Limits

DataWorks supports O&M only for EMR Spark Streaming and EMR Streaming SQL tasks.

Go to the Real-time Computing Nodes page

  1. Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose Data Development and Governance > Operation Center. On the page that appears, select the desired workspace from the drop-down list and click Go to Operation Center.

  2. In the left-side navigation pane of the Operation Center page, choose Real-time Node O&M > Real-time Computing Nodes.

View the real-time compute tasks in the task list

The Stream Task page displays all real-time compute tasks in the production environment. You can view the basic information about, start, stop, or undeploy a real-time compute task. You can also configure alert rules for a real-time compute task.实时计算任务

Section

Description

1

In this section, you can search for a real-time compute task by task ID or task name. You can also specify one of the following filter conditions to perform the operation: Status, Cluster Type, Cluster Name, Task Type, and Responsible Person.

Note
  • If you search for tasks by task name or task ID, the search result is affected by other filter conditions that you specified. Only tasks that meet all the filter conditions that you specified are displayed.

  • If you search for a real-time compute task by task name, fuzzy match is supported. After you enter a keyword, all real-time compute tasks whose names contain the keyword are displayed.

2

In this section, you can view the basic information and details of a real-time compute task.

  • Node Name: the name of the real-time compute task. You can click the task name to go to the details page of the task.

  • Node ID: the ID of the real-time compute task.

  • Status: the status of the real-time compute task. The task can be in one of the following states: Not Running, Generating, Pending (Resources), Starting, Running, Stopped, Failed, Restarting, and Undeploying.

  • Start running: the time when the real-time compute task started to run.

  • End Running: the time when the running of the real-time compute task was complete.

  • Runtime: the running duration of the real-time compute task. Unit: seconds.

  • Task type: the type of the real-time compute task.

    Note

    DataWorks supports O&M only for EMR Spark Streaming and EMR Streaming SQL tasks.

  • Cluster Type: the type of the cluster in which the real-time compute task runs. DataWorks supports O&M only for real-time compute tasks that are run in the E-MapReduce (EMR) clusters.

  • Cluster Name: the name of the EMR cluster in which the real-time compute task runs.

  • Deployed At: the time when the real-time compute task was deployed. The time is in the yyyy-MM-dd HH:mm:ss format.

  • Responsible Person: the owner of the workspace to which the real-time compute task belongs.

Note

You can rank all real-time compute tasks in ascending or descending order by Starting running, End Running, or Runtime.

3

In this section, you can perform the following operations on a real-time compute task:

4

In this section, you can perform an operation on multiple real-time compute tasks at a time. You can select multiple tasks and click Start, Terminate, Undeploy, or Monitoring Setting to perform the related operation on these tasks.

View the details of a real-time compute task

On the Stream Task page, find the real-time compute task whose details you want to view and click the task name to go to the details panel of the task. In the task details panel, you can view the details, run logs, operation logs, and code of the task on the Details, Runtime Log, Operation Log, and Code tabs. This allows you to quickly locate operations performed on the task and operation records, obtain details of the errors reported for the task, and fix the errors.实时计算任务详情页面