Intelligent Media Management (IMM) capabilities are integrated into Object Storage Service (OSS), allowing you to utilize various data processing and analysis features of IMM within OSS. This topic describes how to use the x-oss-process capability of OSS to access IMM features.
Prerequisites
IMM-related features are available only in regions where IMM is available. For more information, see Endpoints.
NoteThe APIs and SDKs for the new data processing feature are available in all regions that support IMM. Access to features of the new version from the console is in phased testing and is available only to select users in the Germany (Frankfurt) and China (Qingdao) regions.
IMM is activated.
Billing
Using IMM features incurs costs. For more information, see Billing overview.
Procedure
Step 1: Create an IMM project
Log on to the IMM Console.
In the lower part of the left-side navigation pane, click Try New Version. If Switch to Old Version is displayed in the lower part of the navigation pane, skip this step.
In the left-side navigation pane, click Project List.
Click Create Project.
Step 2: Bind an OSS bucket
Call the AttachOSSBucket operation to bind the project to an OSS bucket. In this example, the operation is called in OpenAPI Explorer to bind the project to an OSS bucket.
Log on to OpenAPI Explorer. In the upper-left corner of the page, select the IMM API version and region.

To reduce cross-network latency and costs, bind the project only to an OSS bucket in the same region.
On the Parameters tab, enter the name of the project in the ProjectName field and the OSSBucket to bind, then click Initiate Call.

View the response on the right.

Step 3: Grant permissions
For more information about permissions, see Permissions.
Permission configurations are required only for a RAM user or RAM role. If you are using an Alibaba Cloud account, skip this step.
Supported features
Anonymous access is not supported. For more information about how to generate signed URLs for access, see sign.
Media processing
For more information, see Audio and video processing.
Feature | Parameter | Description |
video/convert | Converts video files in OSS to the required format. | |
video/animation | Converts video files in OSS to an animated image format, such as GIF or WebP. | |
video/sprite | Captures frames from a video file in OSS, stitches them into a sprite sheet, and converts it to the required image format. | |
video/snapshots | Captures frames from a video file in OSS and converts them to the required image format. | |
video/concat | Concatenates video files in OSS into a single video and converts it to the required format. | |
audio/convert | Converts audio files in OSS to the required format. | |
audio/concat | Concatenates audio files in OSS into a single audio file and converts it to the required format. | |
audio/info | Extracts media format and stream information from an audio file in OSS. | |
video/info | Extracts media format and stream information from a video file in OSS. |
Document processing
For more information about the parameters, see Document processing and Intelligent document processing.
Feature | Parameter | Description |
doc/preview | Previews documents in OSS. | |
doc/edit | Collaboratively edits documents in OSS. | |
doc/convert | Creates a snapshot of a document in OSS. | |
doc/convert | Converts the format of documents in OSS. | |
| Translate text into multiple languages, such as Chinese and English. | |
| Polishes the content of a document. | |
| Automatically generates a brief summary of a document. | |
| Automatically generates coherent and logical follow-up content based on a given starting text, topic, and style. | |
| Optimizes the language and style of an existing document. | |
| Adjusts the tone and optimizes the expression of a document. |
Image intelligence
For more information about the parameters, see IMG parameters.
Feature | Parameter | Description |
image/faces | Detects the locations of faces in an image and analyzes facial attributes. | |
image/bodies | Detects the locations of human bodies in an image. | |
image/cars | Detects and analyzes vehicles and license plates in an image. | |
image/codes | Recognizes QR codes in an image. | |
image/labels | Recognizes labels for scenes, objects, and events in an image. | |
image/score | Provides a comprehensive score for the aesthetic quality (such as color and saturation) of an image. | |
image/blindwatermark | Adds a text-based blind watermark to an image. | |
image/deblindwatermark | Extracts a text-based blind watermark from an image. |