Area | Description |
1 | Solution: You can create a solution to manage multiple workflows. A workflow can be added to one or more solutions. Solutions can be displayed by using lists and cards. Business Flow: A workflow is an abstract business entity that you can use to organize code development operations based on your business requirements.
Click the icon to show all solutions or workflows in a workspace. |
2 | Refresh ( ): After you modify a workflow or solution, you can click this icon to refresh the Scheduled Workflow pane. Locate ( ): You can click this icon to find the node whose configuration tab is displayed on the right side of the current page. Search Code ( ): You can click this icon to search for a code snippet by using keywords. This way, you can find all nodes that contain the code snippet in the Scheduled Workflow, Manually Triggered Workflows, Ad Hoc Query, and Recycle Bin panes and view the details of the code snippet in a centralized manner. You can also use this feature to identify the node that causes changes to a table. Batch Operation ( ): You can click this icon to modify the configurations of multiple tables, resources, or functions at a time. The configurations include the owner, compute engine instance, resource group for scheduling, rerun properties, scheduling type, scheduling cycle, and scheduling timeout period. Import Data ( ): You can click this icon to upload the data in an on-premises file to a table in DataWorks. You can import data in an on-premises file only to a MaxCompute table. Create ( ): You can click Create to quickly create a workflow, node, table, resource, or function. Solution and workflow directory trees: All: This directory tree displays all created objects, including nodes, resources, and functions, in the current workspace by solution and workflow. Owned by Me: This directory tree displays the objects, including nodes, resources, and functions, that are owned by the current account by solution and workflow. My Favorites: This directory tree displays the objects, including nodes, resources, and functions, that are added to favorites by the current account by solution and workflow.
Node search: Exact search: You can enter the name of a node or the identifier of a node creator in the search box and click the icon to search for the node. Search by node type: You can click the icon to specify the types of nodes that you want to search. After you specify a node type, the directory tree displays only nodes of the specified type in the current workspace. Note You can determine whether to hide compute engine instances or node folders based on your business requirements. After you select Hide Engine Instances or Hide Node Folders, compute engine instances or node folders are not displayed in the directory tree. Hide Engine Instances and Hide Node Folders are applicable only to workflows of the latest version. In most cases, if a compute engine contains only one compute engine instance, we recommend that you hide the compute engine instance. If you do not need to use node folders, such as Data Analytics, Table, Resource, and Function, you can hide them.
Note Before you perform data development operations in a new workspace, you must create a workflow and a node in the workflow. For more information about how to create a workflow, see Create a workflow. |
3 | In this area, you can use a directory tree to manage nodes, tables, resources, and functions in each workflow. Workflow: the unit for business development. Node: the smallest unit for code development. You can develop code by node type, such as engine nodes, algorithm nodes, Data Integration nodes, database nodes, or general nodes. Table: You can manage tables in DataStudio in a visualized manner. Resource: You can upload resources in DataStudio in a visualized manner. Note You can upload resources of only the MaxCompute, E-MapReduce (EMR), and Cloudera's Distribution including Apache Hadoop (CDH) compute engines in a visualized manner. Function: You can register functions in a visualized manner. Note You can register functions of only the MaxCompute, EMR, and CDH compute engines in a visualized manner.
The icon before the name of a node indicates the status of the node: icon: indicates that the node of the current version is not committed. You can click this icon to commit the node.
icon: indicates that the node is not deployed. You can click this icon to deploy the node.
The last time when the node is edited is displayed after the node name. You can double-click the name of a workflow to go to the configuration tab of the workflow, as shown in Area 5 to Area 8. On this tab, you can perform data development operations. |
4 | Resource Group Orchestration ( ): You can click this icon to change the resource groups for scheduling used by multiple nodes in a workflow during data development. If multiple resource groups for scheduling are used in your workspace, you can use this feature to change the resource groups for scheduling for the nodes in the workspace based on your business requirements. This helps you improve resource utilization. After you change the resource groups for scheduling used by multiple nodes, you must deploy the nodes to the production environment so that the change can take effect in the production environment. |
5 | Common Nodes: This section displays the common types of nodes in the current workspace. This helps you quickly select a node type and create a node. Node Group: You can use this feature to reference a set of nodes across workflows. You can add nodes that are frequently used in a workflow to a node group and reuse the node group in other workflows. Quick node creation: You can drag nodes in sections, such as Data Integration, MaxCompute, and EMR, to the right-side canvas of a workflow to create the nodes in the workflow.
|
6 | Tools on the canvas: Switch Layout ( ): You can click this icon to switch the layout of the canvas to Vertical, Horizontal, or Grid. Box ( ): You can click this icon to select nodes to form a node group and perform operations on the node group to manage selected nodes. Refresh ( ): After you modify a workflow, you can click this icon to refresh the workflow. Format ( ): You can click this icon to horizontally align the nodes on the canvas. Adapt ( ): You can click this icon to adapt the current workflow layout to the size of the canvas. Center ( ): You can click this icon to center nodes on the canvas. 1:1 ( ): You can click this icon to change the scale of the directed acyclic graph (DAG) of nodes to 100%. Zoom In ( ): You can click this icon to zoom in on the nodes in the current workflow. Zoom Out ( ): You can click this icon to zoom out on the nodes in the current workflow. Search ( ): You can click this icon and enter a keyword in the search box to search for a node whose name contains the keyword. Note Fuzzy match is supported. After you enter a keyword, DataWorks displays all nodes whose names contain the keyword in the current workflow. Toggle Full Screen View ( ): You can click this icon to view the current workflow in full screen. Hide Engine Information ( ): You can click this icon to show or hide the engine information of each node.
|
7 | Tabs in the right-side navigation pane: Workflow Parameters: You can click this tab and assign a value to a variable in the code for all ODPS SQL nodes in the current workflow at a time. Change History: You can click this tab and view the operation records of nodes in the current workflow. Versions: Each time nodes in the workflow are committed, a new version is generated for the workflow. You can click this tab and view all versions and the details of each version.
|
8 | Tools in the toolbar and tools above the configuration tab: Submit ( ): You can click this icon to commit one or more updated nodes in the current workflow to the Deploy page. Run ( ): You can click this con to run all nodes in the current workflow. Stop ( ): If the current workflow is running, you can click this icon to stop the nodes from running in the workflow. Deploy ( ): You can click this icon to go to the Deploy page and view the nodes to be deployed in the current workflow. Then, you can deploy nodes based on your business requirements. Operation Center ( ): You can click this icon to go to Operation Center in the production environment to view the O&M details of nodes in the current workflow. View opened configuration tabs: If you have opened multiple configuration tabs on the DataStudio page, you can click the icon to view all configuration tabs that are open from the drop-down list. Close opened configuration tabs: You can click the icon to close one or more configuration tabs.
|