Use DataWorks to perform an offline sync of data to Hologres - Hologres

This topic describes how to use DataWorks to synchronize data in a batch from various databases to Hologres. This lets you perform high-concurrency, low-latency query and analysis in Hologres.

Prerequisites

Activate DataWorks. For more information, see Get started.
Activate a Hologres instance and attach it to a DataWorks workspace. For more information, see Attach a Hologres instance.
Activate the required database.

Note

For information about cross-region data synchronization, see Network connectivity solutions.

Background information

Hologres is a real-time interactive analysis product that is deeply integrated with DataWorks and the big data ecosystem. You can use the data integration feature of DataWorks to synchronize data offline from a database to Hologres for high-concurrency, low-latency query and analysis.

Common databases that support offline data synchronization include RDS for MySQL, Oracle, PolarDB, and SQL Server.

Note

To view more supported databases, see Supported data sources and read/write plug-ins.

Related documentation: MySQL Reader, Oracle Reader, PolarDB Reader, SQL Server Reader, and Hologres Writer.

Procedure

Configure data sources.
Before you perform an offline sync, you must configure the source database and the Hologres data source.
- Configure the source database:
  - Configure a MySQL data source
  - Configure an Oracle data source
  - Configure a PolarDB data source
  - Configure a SQL Server data source
  - For information about how to configure other data sources, see Configure a data source.
- Configure the Hologres data source: Configure a Hologres data source
Configure an offline sync task.
After you configure the data sources, you must configure a sync task to synchronize data offline from the database to Hologres. You can configure the task using the Codeless UI or the Code Editor.
- Configure a task in the codeless UI
- Configure a task in the code editor
Query data.
After the sync task is complete, you can query the synchronized data in Hologres.
```
SELECT * FROM rds_test;
```