This topic describes how to use DataWorks to synchronize data in a batch from various databases to Hologres. This lets you perform high-concurrency, low-latency query and analysis in Hologres.
Prerequisites
Activate DataWorks. For more information, see Get started.
Activate a Hologres instance and attach it to a DataWorks workspace. For more information, see Attach a Hologres instance.
Activate the required database.
For information about cross-region data synchronization, see Network connectivity solutions.
Background information
Hologres is a real-time interactive analysis product that is deeply integrated with DataWorks and the big data ecosystem. You can use the data integration feature of DataWorks to synchronize data offline from a database to Hologres for high-concurrency, low-latency query and analysis.
Common databases that support offline data synchronization include RDS for MySQL, Oracle, PolarDB, and SQL Server.
To view more supported databases, see Supported data sources and read/write plug-ins.
Related documentation: MySQL Reader, Oracle Reader, PolarDB Reader, SQL Server Reader, and Hologres Writer.
Procedure
Configure data sources.
Before you perform an offline sync, you must configure the source database and the Hologres data source.
Configure the source database:
For information about how to configure other data sources, see Configure a data source.
Configure the Hologres data source: Configure a Hologres data source
Configure an offline sync task.
After you configure the data sources, you must configure a sync task to synchronize data offline from the database to Hologres. You can configure the task using the Codeless UI or the Code Editor.
Query data.
After the sync task is complete, you can query the synchronized data in Hologres.
SELECT * FROM rds_test;