This topic describes how to use the data transmission service to migrate data from a MySQL database to a MySQL tenant of OceanBase Database.
A data migration task remaining in an inactive state for a long time may fail to be restored depending on the retention period of incremental logs. Inactive states are Failed, Stopped, and Completed. The data transmission service releases data migration tasks remaining in an inactive state for more than 3 days to reclaim related resources. We recommend that you configure alerts for data migration tasks and handle task exceptions in a timely manner.
Prerequisites
The data transmission service has the privilege to access cloud resources. For more information, see Grant privileges to roles for data transmission.
You have created dedicated database users for data migration in the source MySQL database and the target MySQL tenant of OceanBase Database and granted required privileges to the users. For more information, see Configure user privileges for a data source.
Binlogs have been enabled. For more information about binlogs in an ApsaraDB RDS for MySQL instance, see Manage binary log files. For more information about binlogs in a PolarDB for MySQL instance, see Enable binary logging.
Limitations
Limitations on the source database
Do not perform DDL operations that modify database or table schemas during schema migration or full migration. Otherwise, the data migration task may be interrupted.
The data transmission service supports MySQL 5.5, 5.6, 5.7, and 8.0.
The data transmission service supports only the MySQL InnoDB storage engine and is unavailable for other engines.
NoteIt supports the InnoDB and X-Engine storage engines for RDS for MySQL instances.
The data transmission service does not support an index field greater than 767 bytes (or 191 characters) in length in MySQL databases.
The data transmission service does not support CASCADE foreign key migration for the source MySQL database.
The data transmission service supports the migration of an object only when the following conditions are met: the database name, table name, and column name of the object are ASCII-encoded without special characters. The special characters are line breaks, spaces, and the following characters: . | " ' ` ( ) = ; / & \.
The precheck fails if the primary key is of the FLOAT or DOUBLE data type. We recommend that you do not specify a column of such a data type as the primary key.
The data transmission service does not support triggers in the target database. If triggers exist in the target database, the data migration may fail.
The clock of the source database must be synchronized with that of the target database.
The RDS for MySQL database and the VPC in which the self-managed database is located must be in the same region as the OceanBase database.
Considerations
The host of the MySQL database must have sufficient outbound bandwidth. Insufficient outbound bandwidth on the host will slow down log parsing and data migration, which may increase the latency of data synchronization.
Take note of the following considerations if the source and target databases use different collations:
The data in the target database may be inconsistent with that in the source database after data synchronization. For example, if the collation for the source database is utf8mb4_0900_ai_ci and that for the target database is utf8mb4_general_ci, two data records
A
andA
cannot be both written to the target database, causing data loss.Data inconsistency will be detected during verification for a table that uses a column of the VARCHAR type as the primary key.
If the clocks between nodes or between the client and the server are out of synchronization, the latency may be inaccurate during incremental synchronization or reverse incremental synchronization.
For example, if the clock is earlier than the standard time, the latency can be negative. If the clock is later than the standard time, the latency can be positive.
If incremental parsing is required for the MySQL database, you must specify the ID of the MySQL server (
server_id
).If you modify a unique index at the target when DDL synchronization is disabled, you must restart the data migration task to avoid data inconsistency.
Check whether the migration precision of the data transmission service for columns of data types such as DECIMAL, FLOAT, and DOUBLE is as expected. If the precision of the target field type is lower than that of the source field type, the value with a higher precision may be truncated. This may result in data inconsistency between the source and target fields.
If the source instance is a read-only RDS for MySQL database, the precheck displays a prompt, which indicates the risk of incremental log consumption.
Take note of the following considerations if you want to aggregate multiple tables:
We recommend that you configure the mappings between the source and target databases by specifying matching rules.
We recommend that you manually create schemas at the target. If you create a schema by using the data transmission service, skip the failed objects in the schema migration step.
A difference between the source and target table schemas may result in data consistency. Some known scenarios are described as follows:
When you manually create a table schema in the target, if the data type of any column is not supported by the data transmission service, implicit data type conversion may occur in the target, which causes inconsistent column types between the source and target databases.
If the length of a column at the target is shorter than that in the source database, the data of this column may be automatically truncated, which causes data inconsistency between the source and target databases.
Take note of the following considerations related to character sets:
The utf8mb3 character set used in the source database will be changed to utf8mb4 after data is migrated to the target database.
If the target is a MySQL tenant of OceanBase Database V4.1.0 or later, the Latin1 character set supports migration.
If you selected only Incremental Synchronization when you created the data migration task, the data transmission service requires that the local incremental logs of the source database be retained for at least 48 hours.
If you selected Full Migration and Incremental Synchronization when you created the data migration task, the data transmission service requires that the local incremental logs of the source database be retained for at least 7 days. If the data transmission service cannot obtain incremental logs, the data migration task may fail or even the data between the source and target databases may be inconsistent after migration.
If the MySQL database contains two or more primary clusters, set the
log_slave_updates
parameter to ON. Otherwise, the data source cannot receive all binlogs, which may lead to data loss.If the source or target database contains table objects that differ only in letter cases, the data migration results may not be as expected due to case insensitivity in the source or target database.
Supported source and target instance types
In the following table, OB_MySQL stands for a MySQL tenant of OceanBase Database, which can be an OceanBase cluster instance, an OceanBase tenant instance, or a serverless instance.
Source | Target |
MySQL (ApsaraDB RDS instance) | OB_MySQL |
MySQL (PolarDB instance) | OB_MySQL |
MySQL (self-managed database in a VPC) | OB_MySQL |
MySQL (database gateway) | OB_MySQL |
MySQL (self-managed database with a public IP address) | OB_MySQL |
Data type mappings
MySQL database | MySQL tenant of OceanBase Database |
INTEGER | INTEGER |
TINYINT | TINYINT |
MEDIUMINT | MEDIUMINT |
BIGINT | BIGINT |
SMALLINT | SMALLINT |
DECIMAL | DECIMAL |
NUMERIC | NUMERIC |
FLOAT | FLOAT |
REAL | REAL |
DOUBLE PRECISION | DOUBLE PRECISION |
BIT | BIT |
CHAR | CHAR |
VARCHAR | VARCHAR |
BINARY | BINARY |
VARBINARY | VARBINARY |
BLOB | BLOB |
TEXT | TEXT |
ENUM | ENUM |
SET | SET |
JSON | JSON |
DATE | DATE |
DATETIME | DATETIME |
TIMESTAMP | TIMESTAMP |
TIME | TIME |
YEAR | YEAR |
GEOMETRY | GEOMETRY |
POINT | POINT |
LINESTRING | LINESTRING |
POLYGON | POLYGON |
MULTIPOINT | MULTIPOINT |
MULTILINESTRING | MULTILINESTRING |
MULTIPOLYGON | MULTIPOLYGON |
GEOMETRYCOLLECTION | GEOMETRYCOLLECTION |
Procedure
Log on to the ApsaraDB for OceanBase console and purchase a data migration task.
For more information, see Purchase a data migration task.
Choose Data Transmission > Data Migration. On the page that appears, click Configuration for the data migration task.
If you want to reference the configurations of an existing task, click Reference Configuration. For more information, see Reference the configuration of a data migration task.
On the Select Source and Target page, configure the parameters.
Parameter
Description
Data Migration Task Name
We recommend that you set it to a combination of digits and letters. It must not contain any spaces and cannot exceed 64 characters in length.
Source
If you have created a MySQL data source, select it from the drop-down list. Otherwise, click New Data Source in the drop-down list and create one in the dialog box that appears on the right. For more information about the parameters, see Create a MySQL data source.
Target
If you have created a data source for the MySQL tenant of OceanBase Database, select it from the drop-down list. Otherwise, click New Data Source in the drop-down list and create one in the dialog box that appears on the right. For more information about the parameters, see Create an OceanBase data source.
Tag (Optional)
Click the field and select a target tag from the drop-down list. You can also click Manage Tags to create, modify, and delete tags. For more information, see Use tags to manage data migration tasks.
Click Next. On the Select Migration Type page, select One-way Synchronization for Synchronous Topology.
The data transmission service supports both one-way and two-way synchronization. This topic describes how to configure one-way synchronization task. For information about two-way synchronization, see Configure a two-way synchronization task.
Select a type for the current data migration task.
If you select One-way Synchronization for Synchronous Topology, the supported migration types are Schema Migration, Full Migration, Incremental Synchronization, Full Verification, and Reverse Increment.
Migration type
Description
Schema migration
After a schema migration task is started, the data transmission service migrates the definitions of database objects (such as tables, indexes, constraints, comments, and views) from the source database to the target database and automatically filters out temporary tables.
In a task that migrates schemas from a MySQL database to a MySQL tenant of OceanBase Database, the database that does not exist in the target can be automatically created.
Full migration
After a full migration task is started, the data transmission service migrates existing data from tables in the source database to corresponding tables in the target database. If you select Full Migration, we recommend that you use the
ANALYZE
statement to collect the statistics of the MySQL database before data migration.Incremental synchronization
After an incremental synchronization task is started, the data transmission service synchronizes changed data (data that is added, modified, or removed) from the source database to corresponding tables in the target database.
Options for Incremental synchronization are DML synchronization and DDL synchronization. You can select operations as needed. For more information, see Configure DDL/DML synchronization. Incremental synchronization has the following limitations:
If you select DDL Synchronization, when you perform a DDL operation that is not supported by the data transmission service in the source database, the data migration may be interrupted.
If the DDL operation is ADD COLUMN, we recommend that you set the column to a NULL column. Otherwise, data migration may be interrupted.
Full verification
After the full migration and incremental synchronization tasks are completed, the data transmission service automatically initiates a full verification task to verify the tables in the source and target databases.
If you select Full Verification, we recommend that you collect the statistics of the MySQL database and the MySQL tenant of OceanBase Database before full verification.
If you selected Incremental Synchronization but did not select all DML operations in the DML Synchronization section, you cannot select Full Verification.
The data transmission service supports full verification only for tables with a primary key or non-null unique key.
Reverse incremental synchronization
Data changes made in the target database after the business database switchover are synchronized to the source database in real time through reverse incremental synchronization.
Generally, incremental synchronization configurations are reused for reverse incremental synchronization. You can also customize the configurations for reverse incremental synchronization as needed.
Click Next. On the Select Migration Objects page, specify migration objects for the data migration task.
You can select Specify Objects or Match Rules to specify the migration objects. This topic describes how to use Specify Objects to specify the migration objects. For information about matching rules, see the "Wildcard patterns for data migration/synchronization between databases" section in the Configure and modify matching rules topic.
ImportantThe names of tables to be migrated, as well as the names of columns in the tables, must not contain Chinese characters.
If a database or table name contains double dollar signs ($$), you cannot create the migration task.
If you have selected DDL synchronization on the Select Migration Type page, we recommend that you select migration objects by using the Match Rules option. This way, all new objects that meet the migration rules will be synchronized. If you select migration objects by using the Specify Objects option, new or renamed objects will not be synchronized.
In the Select Migration Objects section, select Specify Objects.
In the Source Object(s) list of the Specify Migration Scope section, select the objects to migrate. You can select tables and views of one or more databases.
Click > to add them to the Target Object(s) list.
The data transmission service allows you to import objects from text files, rename target objects, set row filters, view column information, and remove a single or all migration objects.
NoteWhen you select Match Rules to specify migration objects, object renaming is implemented based on the syntax of the specified matching rules. In the operation area, you can only set filter conditions. For more information, see Configure and modify matching rules.
Operation
Description
Import objects
In the list on the right, click Import Objects in the upper-right corner.
In the dialog box that appears, click OK.
ImportantThis operation will overwrite previous selections. Proceed with caution.
In the Import Objects dialog box, import the objects to be migrated.
You can import CSV files to rename databases or tables and set row filtering conditions. For more information, see Download and import the settings of migration objects.
Click Validate.
After you import the migration objects, check their validity. Column field mapping is not supported at present.
After the validation succeeds, click OK.
Rename objects
The data transmission service allows you to rename migration objects. For more information, see Rename a database table.
Configure settings
The data transmission service allows you to filter rows by using
WHERE
conditions. For more information, see Use SQL conditions to filter data.You can also view column information of the migration objects in the View Columns section.
Remove one or all objects
The data transmission service allows you to remove a single object or all migration objects that are added to the right-side list during data mapping.
Remove a single migration object
In the list on the right, move the pointer over the object that you want to remove, and click Remove to remove the migration object.
Remove all migration objects
In the list on the right, click Remove All in the upper-right corner. In the dialog box that appears, click OK to remove all migration objects.
Click Next. On the Migration Options page, configure the parameters.
Full migration
The following table describes the parameters for full migration, which are displayed only if you have selected One-way Synchronization and Full Migration in sequence on the Select Migration Type page.
Parameter
Description
Read Concurrency
The concurrency for reading data from the source during full migration. The maximum value is 512. A high read concurrency may incur excessive stress on the source, affecting the business.
Write Concurrency
The concurrency for writing data to the target during full migration. The maximum value is 512. A high write concurrency may incur excessive stress on the target, affecting the business.
Full Migration Rate Limit
You can choose whether to limit the full migration rate as needed. If you choose to limit the full migration rate, you must specify the records per second (RPS) and bytes per second (BPS). The RPS specifies the maximum number of data rows migrated to the target per second during full migration, and the BPS specifies the maximum amount of data in bytes migrated to the target per second during full migration.
NoteThe RPS and BPS values specified here are only for throttling. The actual full migration performance is subject to factors such as the settings of the source and target and the instance specifications.
Handle Non-empty Tables in Target Database
Valid values are Ignore and Stop Migration.
If you select Ignore, when the data to be inserted conflicts with existing data of a target table, the data transmission service logs the conflicting data while retaining the existing data.
ImportantIf you select Ignore, data is pulled in IN mode during full verification. In this case, verification is inapplicable if the target contains data that does not exist in the source, and the verification performance is downgraded.
If you select Stop Migration and a target table contains records, an error prompting migration unsupported is reported during full migration. In this case, you must process the data in the target table before continuing with the migration.
ImportantIf you click Restore in the dialog box prompting the error, the data transmission service ignores this error and continues to migrate data. Proceed with caution.
Post-Indexing
Specifies whether to create indexes after the full migration is completed. Post-indexing can shorten the time required for full migration. For more information about the considerations on post-indexing, see the description below.
ImportantThis parameter is displayed only if you have selected both Schema Migration and Full Migration on the Select Migration Type page.
Only non-unique key indexes can be created after the migration is completed.
If the target OceanBase database returns the following error during index creation, the data transmission service ignores the error and determines that the index is successfully created, without creating it again.
Error message in a MySQL tenant of OceanBase Database:
Duplicate key name
.Error message in an Oracle tenant of OceanBase Database:
name is already used by an existing object
.
If the target is an OceanBase database and you select Allow for this parameter, the following parameters need to be set:
Single Index DDL Concurrency Configuration: the maximum number of concurrent DDL operations allowed for a single index. A larger value indicates higher resource consumption and faster data migration.
Maximum concurrent index DDL quantity configuration: the maximum number of post-indexing DDL operations that the system can call at a time.
If post-indexing is allowed, we recommend that you use a CLI client to modify the following parameters for business tenants based on the hardware conditions of OceanBase Database and your current business traffic:
// Specify the limit on the file memory buffer size. alter system set _temporary_file_io_area_size = '10' tenant = 'xxx'; // Disable throttling in OceanBase Database V4.x. alter system set sys_bkgd_net_percentage = 100;
Incremental synchronization
The following parameters are displayed only if you have selected One-way Synchronization and Incremental Synchronization in sequence on the Select Migration Type page.
Parameter
Description
Write Concurrency
The concurrency for writing data to the target during incremental synchronization. The maximum value is 512. A high write concurrency may incur excessive stress on the target, affecting the business.
Incremental Synchronization Rate Limit
You can choose whether to limit the incremental synchronization rate as needed. If you choose to limit the incremental synchronization rate, you must specify the RPS and BPS. The RPS specifies the maximum number of data rows synchronized to the target per second during incremental synchronization, and the BPS specifies the maximum amount of data in bytes synchronized to the target per second during incremental synchronization.
NoteThe RPS and BPS values specified here are only for throttling. The actual incremental synchronization performance is subject to factors such as the settings of the source and target and the instance specifications.
Incremental Synchronization Start Timestamp
This parameter is not displayed if you have selected Full Migration on the Select Migration Type page.
If you have selected Incremental Synchronization but not Full Migration, specify a point in time after which the data is to be synchronized. The default value is the current system time. For more information, see Set an incremental synchronization timestamp.
Reverse incremental synchronization
The following parameters are displayed only if you have selected One-way Synchronization and Reverse Increment in sequence on the Select Migration Type page. By default, incremental synchronization configurations are reused for reverse incremental synchronization.
You can choose not to reuse the incremental synchronization configurations and configure reverse incremental migration as needed.
Parameter
Description
Write Concurrency
The concurrency for writing data to the source during reverse incremental synchronization. The maximum value is 512. A high concurrency may incur excessive stress on the source, affecting the business.
Reverse Increment Rate Limit
You can choose whether to limit the reverse incremental synchronization rate as needed. If you choose to limit the reverse incremental synchronization, you must specify the RPS and BPS. The RPS specifies the maximum number of data rows synchronized to the source per second during reverse incremental synchronization, and the BPS specifies the maximum amount of data in bytes synchronized to the source per second during reverse incremental synchronization.
NoteThe RPS and BPS values specified here are only for throttling. The actual reverse incremental synchronization performance is subject to factors such as the settings of the source and target and the instance specifications.
Incremental Synchronization Start Timestamp
This parameter is not displayed if you have selected Full Migration on the Select Migration Type page.
If you have selected Incremental synchronization but not Full Migration, the forward switchover start timestamp (if any) is used by default. This parameter cannot be modified.
Advanced parameters
This section is displayed only if the target is a MySQL tenant of OceanBase Database V4.3.0 or later and you have selected Schema Migration or selected DDL Synchronization for Incremental Synchronization on the Select Migration Type page.
The storage types supported for target table objects are Default, Row storage, Column Storage, and Hybrid columnar storage. For more information, see default_table_store_format.
NoteThe value Default means that other parameters are automatically set based on the parameter configurations of the target. Table objects in schema migration and new table objects in incremental DDL synchronization are written to the corresponding schemas based on the specified storage type.
Click Precheck to start a precheck on the data migration task.
During the precheck, the data transmission service checks the read and write privileges of the database users and the network connections of the databases. A data migration task can be started only after it passes all check items. If an error is returned during the precheck, you can perform the following operations:
Identify and troubleshoot the problem and then perform the precheck again.
Click Skip in the Actions column of the failed precheck item. In the dialog box that prompts the consequences of the operation, click OK.
After the precheck succeeds, click Start Task.
If you do not need to start the task now, click Save. You can start the task later on the Migration Tasks page or by performing batch operations. For more information about batch operations, see Perform batch operations on data migration tasks.
The data transmission service allows you to modify the migration objects and their row filtering conditions when a migration task is running. For more information, see View and modify migration objects and their filter conditions. After the data migration task is started, it is executed based on the selected migration types. For more information, see View migration details.