The first time you activate DataWorks in a region, you must purchase a specific DataWorks edition and a pay-as-you-go serverless resource group. After you activate DataWorks, the other pay-as-you-go resources and features provided by DataWorks, such as intelligent monitoring, data quality monitoring, and APIs, are enabled by default. You are charged for the resources and features based on your actual usage. This topic describes how to activate DataWorks.
Precautions
Region: You can purchase an edition and a resource group of DataWorks in a region. If you want to use the service capabilities that are provided by the edition and the resource group in multiple regions, you must purchase the edition and the resource group in the regions.
Related engines: The first time you activate DataWorks in a region, the system automatically activates MaxCompute (pay-as-you-go) and creates the AliyunServiceRoleForDataWorksEngine and AliyunServiceRoleForDataWorksOnEmr service-linked roles in the region. This helps you quickly experience the core scenarios of the big data platform. If you do not use MaxCompute, no related fees are generated.
Prerequisites
Related accounts are prepared based on your business requirements before you activate DataWorks. The following accounts are required:
Alibaba Cloud account (recommended): If you use an Alibaba Cloud account to activate DataWorks in a region, you do not need to activate DataWorks in the region again. For more information about how to prepare an Alibaba Cloud account, see Prepare an Alibaba Cloud account.
RAM user: If you use a RAM user to activate DataWorks, you must attach the
AliyunBSSOrderAccess
andAliyunDataWorksFullAccess
policies to the RAM user. After you attach the policies to a RAM user, the RAM user has higher permissions. Exercise caution when you attach the policies to a RAM user. For more information about how to prepare a RAM user and grant permissions to a RAM user, see Prepare a RAM user.
Activate DataWorks
When you activate DataWorks, you must select the region where you want to deploy the required services and resources, a DataWorks edition, a subscription duration, and the virtual private cloud (VPC) with which you want to associate the resources. This section describes the common procedure for activating DataWorks.
Go to the Workspaces page in the DataWorks console. In the top navigation bar, switch to the region in which you want to activate DataWorks to check whether DataWorks is activated in the region.
If DataWorks is activated in the current region, you can create a workspace. For more information, see Create a workspace.
If DataWorks is not activated in the current region, you can click Purchase Product Portfolio for Free to enable the system to activate DataWorks.
Configure the parameters for the services and resources that you want to purchase.
Configure the parameters as prompted. The following table describes the parameters.
Parameter
Description
Region
Select the region in which you want to activate DataWorks.
DataWorks Edition
Select the DataWorks edition that you want to purchase.
NoteYou can select an appropriate DataWorks edition based on your business requirements. For more information, see the Feature comparison section in the "Differences among DataWorks editions" topic.
Subscription Duration
Select a DataWorks edition and configure the Subscription Duration parameter.
After you confirm that the configurations are correct, read the terms of service and select the check box for Terms of Service.
Confirm the order.
Click Confirm Order and Pay. In the Verify Resources dialog box, view the details of the order.
After the resources are verified, click Next: View Price List to confirm the price of the order.
NoteThe price list includes the fees for the DataWorks edition, the serverless resource group, MaxCompute, and other DataWorks pay-as-you-go resources and features.
In the Price List dialog box, confirm the price and click Next Step: Create Order.
On the Purchase page, click Purchase.
What to do next
Experience the use cases
The first time you activate DataWorks in a region, the system automatically generates a default DataWorks workspace. You can experience use cases in the default workspace. For more information, see Built-in logic of a default workspace.
Develop tasks
Before you develop tasks in DataWorks, we recommend that you create a custom workspace, select a compute engine type based on your business requirements, and add a data source or register a cluster of the compute engine type to DataWorks.
A workspace is the basic unit for task development and member permission management in DataWorks. All data development operations must be performed in a specific workspace. For more information about how to create a workspace, see Create a workspace.
You can develop tasks in DataWorks based only on compute engines. You can add a data source of a specific compute engine type or register a cluster of a specific compute engine type to your workspace. For more information, see Add and manage data sources.
References
A serverless resource group is a general-purpose resource group that can be used to run data synchronization, data computing, data scheduling, and DataService Studio tasks. For more information, see Create and use a serverless resource group.
DataWorks also provides more features and resources, such as intelligent data modeling and advanced analysis. You can enable the features based on your business requirements. For more information, see Billing overview.
For more information about DataWorks, see What is DataWorks?