This topic describes how to configure the Timeline parameters in the SubmitMediaProducingJob operation to produce a video after intelligent data processing.
Usage notes
Intelligent production supports editing and compositing, effect rendering, and templates for live streams, VOD files, and material files from Object Storage Service (OSS). For more information, see Intelligent production overview.
You can produce a video from one or more videos, audio files, images, and subtitle materials by configuring Timeline parameters and calling the SubmitMediaProducingJob operation.
A timeline is created when you add materials and configure effects to create a video. A timeline consists of tracks, materials, and effects. For more information, see Timeline configurations.
For more information about how to use the IMS SDK to edit audio and video files, see Preparations.
Use AI_ASR to convert speech to text and add the text to a video
You can add the "Type": "AI_ASR" setting to convert speech to text. You can also set the subtitle style, such as the font and color.
The feature of converting speech to text is available in the China (Shanghai), China (Beijing), China (Hangzhou), and China (Shenzhen) regions.
Effect | Timeline example |
|
Use AI_TTS to convert text to speech and add the speech to a video
You can add the "Type": "AI_TTS" setting to convert text to speech. This setting can be used together with "Type": "AI_ASR". Content indicates the text that you want to convert. You can also set speech properties, such as Voice, SpeechRate, PitchRate, and Format.
The feature of converting text to speech is available in the China (Shanghai), China (Beijing), and China (Hangzhou) regions.
By default, AI_TTS splits sentences based on punctuation marks such as commas (,) and periods (.). You can define the subtitle style and line break mode for each split sentence.
Effect | Timeline example |
|
Use AI_TTS to convert text to speech and use SSML to control the rhythm or pronunciation
The Content field in AI_TTS supports the Speech Synthesis Markup Language (SSML) markup language. It can be used to correct the pronunciation of specialized words, add pauses, and add different emotions and styles to the speech.
Effect | Timeline example |
|
Use AI_Matting to configure green screen matting
You can configure the "Type": "AI_Matting" setting to extract an object from a green screen and seamlessly superimpose it onto a specified background video or image.
The feature of green screen matting is available in the China (Hangzhou), China (Shanghai), and China (Beijing) regions.
Effect | Timeline example |
|
Use AI_RealMatting to configure background replacement
You can configure the "Type": "AI_RealMatting" setting to extract an object from any background and seamlessly superimpose it onto a specified background video or image.
The feature of background replacement is available in the China (Hangzhou), China (Shanghai), and China (Beijing) regions.
Effect | Timeline example |
|