Services that work with DataWorks

Updated at: 2024-07-17 09:47

DataWorks can work with compute engines to support end-to-end big data development and governance. DataWorks allows you to add data sources to Data Integration and then use Data Integration to transmit data between the data sources. This topic provides the services that can work with DataWorks in typical scenarios.

Supported compute engines

DataWorks allows you to add data sources of different compute engine types to a DataWorks workspace or register open source clusters to DataWorks as data sources. After you associate a data source with DataStudio, you can create tasks of the same type as the compute engine type in the DataWorks console and then enable the system to periodically schedule the tasks. DataWorks supports the following data source types:

  • MaxCompute

  • E-MapReduce (EMR)

  • Hologres

  • AnalyticDB for PostgreSQL

  • AnalyticDB for MySQL

  • CDH

  • ClickHouse

For more information about how to add a data source and associate the data source with DataStudio, see Add and manage data sources and Preparations before data development: Associate a data source or a cluster with DataStudio.

Supported data sources

DataWorks can synchronize batch data or real-time data between different data sources. You can configure clusters or instances in the following services as the data sources of DataWorks: Alibaba Cloud services and self-managed services that are related to databases, unstructured storage, big data, and message queues. You can use DataWorks to integrate data only after you configure data sources.

  • On this page (1, T)
  • Supported compute engines
  • Supported data sources
Feedback