Parameter | Type | Required | Example | Description |
Action | String | Yes | UpdateFile | The operation that you want to perform. |
FileFolderPath | String | No | Business Flow/1/Data Integration/Folder 1/Folder 2 | The path of the file. |
ProjectId | Long | No | 10000 | The ID of the DataWorks workspace. You can log on to the DataWorks console and go to the Workspace page to obtain the workspace ID. |
FileName | String | No | ods_user_info_d | The name of the file. You can set the FileName parameter to another value to change the file name. You can call the ListFiles operation to query the ID of the file whose name you want to change. Then, you can set the FileId parameter to the ID and set the FileName parameter to a new value when you call the UpdateFile operation. |
FileDescription | String | No | File description | The description of the file. |
Content | String | No | SELECT "1"; | The code of the file. The code format varies based on the file type. To view the code format for a specific file type, go to Operation Center, right-click a task of the file type, and then select View Code. |
AutoRerunTimes | Integer | Yes | 3 | The number of automatic reruns that are allowed after an error occurs. |
AutoRerunIntervalMillis | Integer | No | 120000 | The interval between two consecutive automatic reruns after an error occurs. Unit: milliseconds. Maximum value: 1800000 (30 minutes). This parameter corresponds to the Rerun interval parameter that is displayed after the Auto Rerun upon Failure check box is selected in the Schedule section of the Properties tab in the DataWorks console. The interval that you specify in the DataWorks console is measured in minutes. Pay attention to the conversion between the units of time when you call the operation. |
RerunMode | String | No | ALL_ALLOWED | Specifies whether the task that corresponds to the file can be rerun. Valid values: ALL_ALLOWED: The task can be rerun regardless of whether it is successfully run or fails to run. FAILURE_ALLOWED: The task can be rerun only after it fails to run. ALL_DENIED: The task cannot be rerun regardless of whether it is successfully run or fails to run.
This parameter is equivalent to the Rerun parameter in the Schedule section of the Properties panel in the DataWorks console. |
Stop | Boolean | No | false | Specifies whether to suspend the scheduling of the task. Valid values: Setting this parameter to true is equivalent to setting the Recurrence parameter to Skip Execution in the Schedule section of the Properties tab in the DataWorks console. |
ParaValue | String | No | x=a y=b z=c | The scheduling parameters of the task. Configuring this parameter is equivalent to configuring scheduling parameters in the Scheduling Parameter section of the Properties tab in the DataWorks console. For more information, see Configure scheduling parameters. |
StartEffectDate | Long | No | 936923400000 | The start time of automatic scheduling. Set this parameter to a UNIX timestamp representing the number of milliseconds that have elapsed since January 1, 1970, 00:00:00 UTC. Configuring this parameter is equivalent to specifying a start time after you select Specified Time for the Validity Period parameter in the Schedule section of the Properties tab in the DataWorks console. |
EndEffectDate | Long | No | 4155787800000 | The end time of automatic scheduling. Set this parameter to a UNIX timestamp representing the number of milliseconds that have elapsed since January 1, 1970, 00:00:00 UTC. Configuring this parameter is equivalent to specifying an end time after you select Specified Time for the Validity Period parameter in the Schedule section of the Properties tab in the DataWorks console. |
CronExpress | String | No | 00 00-59/5 1-23 * * ? | The cron expression that represents the periodic scheduling policy of the task. This parameter corresponds to the Cron Expression parameter in the Schedule section of the Properties tab in the DataWorks console. After you configure the Scheduling Cycle and Scheduled time parameters in the DataWorks console, DataWorks automatically generates a value for the Cron Expression parameter. The following code provides a configuration example: Cron expression for a task that is scheduled to run at 05:30 every day: 00 30 05 * * ? Cron expression for a task that is scheduled to run at the fifteenth minute of each hour: 00 15 * * * ? Cron expression for a task that is scheduled to run every 10 minutes: 00 00/10 * * * ? Cron expression for a task that is scheduled to run every 10 minutes from 08:00 to 17:00 every day: 00 00-59/10 8-23 * * * ? Cron expression for a task that is scheduled to run at 00:20 on the first day of each month: 00 20 00 1 * ? Cron expression for a task that is scheduled to run every three months starting from 00:10 on January 1: 00 10 00 1 1-12/3 ? Cron expression for a task that is scheduled to run at 00:05 every Tuesday and Friday: 00 05 00 * * 2,5
The scheduling system of DataWorks imposes the following limits on cron expressions: |
CycleType | String | No | NOT_DAY | The type of the scheduling cycle of the task. Valid values: NOT_DAY and DAY. The value NOT_DAY indicates that the task is scheduled to run by minute or hour. The value DAY indicates that the task is scheduled to run by day, week, or month. This parameter corresponds to the Scheduling Cycle parameter in the Schedule section of the Properties panel in the DataWorks console. |
DependentType | String | No | USER_DEFINE | The type of the cross-cycle scheduling dependency of the task. Valid values: SELF: The instance generated for the task in the current cycle depends on the instance generated for the task in the previous cycle. CHILD: The instance generated for the task in the current cycle depends on the instances generated for the descendant tasks at the nearest level of the task in the previous cycle. USER_DEFINE: The instance generated for the task in the current cycle depends on the instances generated for one or more specified tasks in the previous cycle. NONE: No cross-cycle scheduling dependency type is specified for the task.
|
DependentNodeIdList | String | No | 5,10,15,20 | The ID of the task on which the task that corresponds to the file depends when the DependentType parameter is set to USER_DEFINE. If you specify multiple IDs, separate them with commas (,). The value of this parameter corresponds to the ID of the task that you specified after you select Other Nodes for the Cross-Cycle Dependency (Original Previous-Cycle Dependency) parameter in the Dependencies section of the Properties tab in the DataWorks console. |
InputList | String | No | project_root,project.file1,project.001_out | The output names of the parent files on which the current file depends. If you specify multiple output names, separate them with commas (,). This parameter corresponds to the Output Name of Ancestor Node parameter in the Parent Nodes subsection of the Dependencies section of the Properties tab in the DataWorks console. Note You must configure this parameter when you call the CreateDISyncTask or UpdateFile operation to create a batch synchronization task. |
ProjectIdentifier | String | No | dw_project | The name of the DataWorks workspace. You can log on to the DataWorks console and go to the Workspace page to obtain the workspace name. You must configure either this parameter or the ProjectId parameter to determine the DataWorks workspace to which the operation is applied. |
FileId | Long | Yes | 100000001 | The ID of the file. You can call the ListFiles operation to obtain the ID. |
OutputList | String | No | dw_project.ods_user_info_d | The output name of the file. This parameter corresponds to the Output Name parameter in the Output Name of Current Node subsection of the Dependencies section of the Properties tab in the DataWorks console. |
ResourceGroupIdentifier | String | No | default_group | The identifier of the resource group that is used to run the task that corresponds to the file. You can call the ListResourceGroups operation to query the available resource groups in the workspace. |
ConnectionName | String | No | odps_source | The name of the data source that is used to run the task. You can call the ListDataSources operation to query the available data sources in the workspace. |
Owner | String | No | 18023848927592 | The ID of the file owner. |
AutoParsing | Boolean | No | true | Specifies whether to enable the automatic parsing feature for the file. Valid values: This parameter corresponds to the Parse Input and Output from Code button in the Dependencies section of the Properties tab in the DataWorks console. |
SchedulerType | String | No | NORMAL | The scheduling type of the task. Valid values: NORMAL: The task is an auto triggered task. MANUAL: The task is a manually triggered task. Manually triggered tasks cannot be automatically scheduled. You can go to the Manually Triggered Workflows pane to view manually triggered tasks. PAUSE: The task is a paused task. SKIP: The task is a dry-run task. Dry-run tasks are started as scheduled but the scheduling system sets the status of the tasks to successful when the scheduling system starts to run the tasks.
|
AdvancedSettings | String | No | {"queue":"default","SPARK_CONF":"--conf spark.driver.memory=2g"} | The advanced configurations of the task. This parameter is valid only for an EMR Spark Streaming task or an EMR Streaming SQL task. This parameter corresponds to the Advanced Settings tab of the task in the DataWorks console. The value of this parameter must be in the JSON format. |
StartImmediately | Boolean | No | true | Specifies whether to run a task immediately after the task is deployed. Valid values: This parameter is valid only for an EMR Spark Streaming task or an EMR Streaming SQL task. This parameter corresponds to the Start Method parameter in the Schedule section of the Configure tab in the DataWorks console. |
InputParameters | String | No | [{"ValueSource": "project_001.first_node:bizdate_param","ParameterName": "bizdate_input"}] | The input parameters of the task. The value of this parameter must be in the JSON format. For more information about the input parameters, see the InputContextParameterList parameter in the Response parameters section of the GetFile operation. This parameter corresponds to the Input Parameters subsection of the Input and Output Parameters section of the Properties tab in the DataWorks console. |
OutputParameters | String | No | [{"Type": 1,"Value": "${bizdate}","ParameterName": "bizdate_param"}] | The output parameters of the task. The value of this parameter must be in the JSON format. For more information about the output parameters, see the OutputContextParameterList parameter in the Response parameters section of the GetFile operation. This parameter corresponds to the Output Parameters subsection of the Input and Output Parameters section of the Properties tab in the DataWorks console. |