Parameter | Description | Required | Default value |
datasource | The name of the data source. It must be the same as the name of the added data source. You can add data sources by using the code editor. | Yes | No default value |
dataTable | The name of the table from which you want to read incremental data. The Stream feature must be enabled for the table. You can enable the Stream feature for a table when you create the table. You can also call the UpdateTable operation to enable this feature after the table is created. | Yes | No default value |
statusTable | The name of the table that Tablestore Stream Reader uses to store status records. These records help find the data that is not required and improve read efficiency. If the specified table does not exist, Tablestore Stream Reader automatically creates such a table. After an offline read task is completed, you do not need to delete the table. The status records in the table can be used for the next read task. You do not need to create a status table. You need to only provide a table name. Tablestore Stream Reader attempts to create a status table on your instance. If no such table exists, Tablestore Stream Reader automatically creates one. If the table already exists, Tablestore Stream Reader determines whether the metadata of the table meets the expectation. If the metadata does not meet the expectation, Tablestore Stream Reader reports an error. After a read task is completed, you do not need to delete the table. The status records in the table can be used for the next read task. Time-to-live (TTL) is enabled for the table, and data automatically expires after the TTL elapses. This way, the table stores a small volume of data. You can use the same status table to store the status records of multiple tables that are specified by the dataTable parameter and managed by the same instance. The status records are independent of each other.
You can configure a name similar to TableStoreStreamReaderStatusTable. Make sure that the name is different from the name of a business-related table. | Yes | No default value |
startTimestampMillis | The start time of the incremental data, in milliseconds. The start time is the left boundary of the left-closed, right-open time range of the incremental data. Tablestore Stream Reader searches for the status records in the table that is specified by the statusTable parameter based on the time that is specified by the startTimestampMillis parameter and reads data from the time. If Tablestore Stream Reader cannot find the status records, Tablestore Stream Reader reads the incremental data that is retained by the system from the first entry, and skips the data that is written later than the time that is specified by the startTimestampMillis parameter.
| No | No default value |
endTimestampMillis | The end time of the incremental data, in milliseconds. The end time is the right boundary of the left-closed, right-open time range of the incremental data. Tablestore Stream Reader exports data from the time specified by the startTimestampMillis parameter and stops exporting data when the timestamp of a data record is later than or equal to the time specified by the endTimestampMillis parameter. After all the incremental data is read, Tablestore Stream Reader stops reading data even before the time specified by the endTimestampMillis parameter.
| No | No default value |
date | The date on which the data that you want to read is generated. Configure this parameter in the yyyyMMdd format, such as 20151111. You must configure the date parameter, the startTimestampMillis and endTimestampMillis parameters, or the startTimeString and endTimeString parameters. For example, Alibaba Cloud Data Process Center schedules tasks by day. Therefore, you need to configure the date parameter and do not need to configure the startTimestampMillis and endTimestampMillis parameters or the startTimeString and endTimeString parameters. | No | No default value |
isExportSequenceInfo | Specifies whether to read time series information. The time series information includes the time when data is written. The default value is false, which indicates that time series information is not read. | No | false |
maxRetries | The maximum number of retries for each request to read incremental data from Tablestore. Default value: 30. Retries are performed at a specific interval. The total duration of 30 retries is approximately 5 minutes. You can keep the default settings. | No | 30 |
startTimeString | The start time of the incremental data, in seconds. The start time is the left boundary of the left-closed, right-open time range of the incremental data. Configure this parameter in the yyyymmddhh24miss format. | No | No default value |
endTimeString | The end time of the incremental data, in seconds. The end time is the right boundary of the left-closed, right-open time range of the incremental data. Configure this parameter in the yyyymmddhh24miss format. | No | No default value |
enableSeekIterator | Specifies whether to determine the offset from which Tablestore Stream Reader starts to read incremental data. If incremental data is frequently read, Tablestore Stream Reader automatically determines the offset based on the offset from which data is read last time. If Tablestore Stream Reader is not run before, data is read from the start time of incremental data. By default, incremental data is stored for seven days. Before the start time arrives, no data is exported. You can add "enableSeekIterator": true to the configurations of Tablestore Stream Reader to help you find the start time from which Tablestore Stream Reader starts to read incremental data. | No | false |
mode | The read mode. If this parameter is set to single_version_and_update_only, data is read in row mode. | No | No default value |
isTimeseriesTable | Specifies whether the table is a time series table. This parameter takes effect only when the mode parameter is set to single_version_and_update_only. | No | false |
column | The names of columns from which you want to read data when you set the mode parameter to single_version_and_update_only . Sample code:
"column":[
{"name":"pk1"},
{"name":"col1"},
{"name":"col2","dataType":"new"},
{"name":"col2","dataType":"old"},
{"name":"col2","dataType":"latest"}
],
The name field specifies the name of the column from which you want to read data. This field is required. The dataType field specifies the type of data that you want to read. The default type is new . This field is optional. The dataType field supports the following enumeration types: new : indicates data in the current column after an update.
old : indicates data in the current column before an update.
latest : indicates the latest data in the current column.
Note If you want to read data in row mode, you must configure this parameter. Otherwise, data cannot be read. | Yes in row mode No in column mode
| No default value |