All Products
Search
Document Center

DataWorks:Configure Hologres Writer

Last Updated:Feb 17, 2025

Utilize Hologres's real-time write capability to build an interactive data warehouse for real-time analysis.

Prerequisites

Before configuring the Hologres Writer node, ensure the corresponding input or transform data source is set up. For more information, see the referenced document.

Background information

  • Supported Hologres data source versions include the following: V0.7, V0.8, V0.9, V0.10, and V1.1.

  • Fields of the UUID data type are not supported for synchronization.

Procedure

  1. Go to the DataStudio page.

    Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose Data Development and O&M > Data Development. On the page that appears, select the desired workspace from the drop-down list and click Go to Data Development.

  2. Hover your mouse over the 新建 icon and click Create Node > Data Integration > Real-time Synchronization.

    Alternatively, expand the business flow, right-click the target business flow, and select Create Node > Data Integration > Real-time Synchronization.

  3. In the Create Node dialog box, select Synchronization Method as Single Table (topic) to Single Table (topic) ETL, enter the Name, and select the Path.

  4. Click Confirm.

  5. On the real-time synchronization node's edit page, click Output > Hologres and drag it to the editing panel. Connect it to the previously configured input or transform node.

  6. Click the Hologres node. In the Node Configuration dialog box, set the parameters.

    Parameter

    Description

    Data Source

    The name of the Hologres data source that you configured. You can select only a Hologres data source.

    If you have not configured a data source, click Create Data Source on the right to go to the Workspace Configuration > Data Source page to create one. For more information, see the referenced document.

    Table

    Select the name of the table to which you want to write data from the current data source.

    You can click Create Table on the right to create a new table, or click Data Preview to confirm.

    Partition Information

    Partition Method

    The default is Dynamic Partitioning Based On Field Content.

    If the Hologres table is a partitioned table, you need to set these partition parameters.

    Partition Field Value Source

    You can select the table field information from the input data source configured in the ancestor node.

    Partition Field Name

    The default is the partition field name set for the table.

    Partition Field Value

    The partition field value can be an Enumeration Value or a Time Value.

    • If the partition field value is an enumeration value, each value in the partition field will create a partition. Therefore, the number of different values must not exceed 1000 per day. If this value is exceeded, partition creation will fail and the real-time task will stop running.

    • If the partition field value is a time value, you need to configure the corresponding time Source Format and the saved Target Format.

    Partitioned Cache Queue Size

    The larger the partitioned cache queue size, the greater the memory consumption. If the data from the source is severely out of order based on the partition field, it is recommended to increase this value and accordingly increase the memory.

    Job Type

    Includes two types: Replay and Insert:

    • Replay indicates the mirror feature. That is, when the source end INSERT a record, Hologres also INSERT a piece of data. When the source end performs UPDATE or DELETE operations, Hologres will execute the corresponding UPDATE or DELETE operations.

    • Insert indicates using Hologres as stream storage to save the data synchronized from the source end through INSERT.

    Write Conflict Policy

    Includes two types: Overwrite and Ignore:

    • Overwrite: Use the new data synchronized from the source end to overwrite the existing data.

    • Ignore: Ignore the new data synchronized from the source end and retain the existing data.

    Field Mapping

    Click Field Mapping to set the mapping of fields between the source and destination ends. The synchronization node synchronizes data based on the field mappings.

  7. Click the 保存 icon in the toolbar.