All Products
Search
Document Center

DataWorks:Data source configuration and management

Last Updated:Dec 04, 2024

Before you configure a data synchronization node, you must configure a source and a destination that are required for the node. This way, you can select the source and destination when you configure the node. This topic describes the items that you need to understand before you configure a data source.

Items

  1. Before you configure a data source, you need to understand the items that are described in the following table.

    Item

    Description

    Types of data sources supported by Data Integration

    Before you configure a data synchronization node, you must configure the data sources that you want to use on the Data Source page of the DataWorks console. Data Integration supports dozens of data source types. For more information, see Supported data source types, Reader plug-ins, and Writer plug-ins.

    Note

    The database from which data is read and the database to which data is written are determined based on data sources. When you configure a source and a destination, you must specify the database or data warehouse from which you want to read data and the database or data warehouse to which you want to write data. This way, when you configure a data synchronization node, you can select the source and destination, and data will be read from and written to the specified databases or data warehouses during data synchronization.

    Special authentication for data sources

    Data Integration allows you to use third-party identity authentication mechanisms to control access to specific data sources. Before you use an authentication mechanism to perform identity authentication, you must upload the required authentication files on the Authentication File Management page of the DataWorks console. Then, you must enable third-party authentication when you add a data source. This way, only trusted applications and services can access the data source. For more information about third-party identity authentication mechanisms, see Manage third-party authentication files.

    Note
    • Only Kerberos authentication is supported. For more information, see Configure Kerberos authentication.

    • Different types of data sources support different authentication mechanisms. For more information, see the topics for configuring different types of data sources.

    Support for accessing specific data sources in RAM authorization mode

    Data Integration allows you to use the RAM authorization mode to access specific data sources. This mode ensures higher security. You can select the RAM authorization mode and a custom RAM role when you configure a data source. This way, you can use the RAM role to access the data source. For more information, see Use the RAM role-based authorization mode to add a data source.

    Isolation of the same data source in the development environment and production environment

    You can add the same data source separately for the development environment and production environment. This way, the data source in the development environment and that in the production environment are isolated from each other. For more information, see Isolate a data source in the development and production environments.

    Note

    Only workspaces in standard mode support isolation for data sources.

    Access to data sources by adding host or DNS configurations

    If the IP address of your data source is frequently changed or you cannot access your data source by using IP addresses, you can add host configurations for the exclusive resource group for Data Integration. For more information about how to add host configurations for an exclusive resource group for Data Integration, see Associate the exclusive resource group for Data Integration with a VPC.

  2. After the required data sources are configured, you can share the data sources by managing permissions on the data sources.

    • DataWorks allows you to share a data source in a workspace with another workspace or a member in another workspace by managing permissions on the data source, and go to the workspace to view the shared data source. You can configure permission settings, such as Editable and Not editable, on a data source for a workspace or member. For more information, see Manage permissions on data sources.

    • If you want to control data read and write operations on a table in a data source, you can create a request processing policy for the data synchronization node that uses the data source in Approval Center. Approval Center allows the user to whom the Workspace Manager role is assigned to use the combination of a source and a destination to specify the data synchronization node on which the operation requests must be processed. For more information, see Request processing policies for Data Integration tasks.