After you prepare data sources, network environments, and resources, you can create a real-time synchronization node to synchronize data to DataHub. This topic describes how to create a real-time synchronization node and view the status of the node.
Prerequisites
- The data sources that you want to use are prepared. Before you configure a data synchronization node, you must prepare the data sources from which you want to read data and to which you want to write data. This way, when you configure a data synchronization node, you can select the data sources. For information about the data source types, readers, and writers that are supported by real-time synchronization, see Data source types that support real-time synchronization. Note For information about the items that you need to understand before you prepare a data source, see Overview.
- An exclusive resource group for Data Integration that meets your business requirements is purchased. For more information, see Create and use an exclusive resource group for Data Integration.
- Network connections are established between the exclusive resource group for Data Integration and the data sources. For more information, see Establish a network connection between a resource group and a data source.
- The data source environments are prepared. You must create an account that can be used to access a database in the source and an account that can be used to access a database in the destination. You must also grant the accounts the permissions required to perform specific operations on the databases based on your configurations for data synchronization. For more information, see Overview.
Precautions
You can use only exclusive resource groups for Data Integration to run real-time synchronization nodes.
- You can use a real-time synchronization node to synchronize data to a DataHub data source only from a PolarDB, OceanBase, MySQL, or Oracle data source.
Usage notes
For information about support of different topic types for synchronization of data changes generated by operations on a source table, sharding strategies for different topic types, data formats, and sample messages, see Appendix: DataHub message formats.
Create a real-time synchronization node
- Create a real-time synchronization node to synchronize all data in a database.
- Configure an exclusive resource group for Data Integration.
- Configure the source and mapping rules.
- Configure the destination topics.
- Select a data source as the destination and configure formats for the destination topics.
- Configure the resources required to run the data synchronization node.
Commit and deploy the real-time synchronization node
- Click the icon in the top toolbar to save the node.
- Click the icon in the top toolbar to commit the node.
- In the Commit Node dialog box, configure the Change description parameter.
- Click Confirm. If you use a workspace in standard mode, you must deploy the node in the production environment after you commit the node. On the left side of the top navigation bar, click Deploy. For more information, see Deploy nodes.