Data Lake Analytics (DLA) allows you to configure a data source, a destination data warehouse, and Object Storage Service (OSS). It provides an automatic and seamless method to synchronizes full data from the data sources to OSS at a specified time. The data source can be ApsaraDB RDS or a self-managed database hosted on an ECS instance. In addition, a schema that is the same as that in the data source is created in OSS and DLA. This schema can be used for the analysis of data in OSS. This process does not affect the business of data sources.

Prerequisites

The following operations are performed:

Note DLA, ApsaraDB RDS, and OSS must be deployed in the same region. If they are deployed in different regions, you cannot use this feature to create a data warehouse.

Procedure

  • Create a data warehouse with one click.
  • After you create a data warehouse, you can manually trigger data synchronization at any time based on your business requirements. During data synchronization, a table schema that is the same as that in the data source such as ApsaraDB RDS or a self-managed database hosted on an ECS instance is created in OSS and DLA.