ApsaraVideo MediaBox SDK (Beta)

An integrated SDK for mobile devices, PCs, and web browsers.

Quick and Efficient Development of Audio and Video Applications

ApsaraVideo MediaBox SDK integrates core capabilities such as live stream ingest, video playback, short video production, retouching, and audio effects. It uses open source user interface (UI) components and provides product-level demos. You can integrate the corresponding SDKs or use the low-code tool AUI Kits to quickly build applications based on your business requirements.

Small SDK Package

Small SDK Package

ApsaraVideo MediaBox SDK integrates the core capabilities of ApsaraVideo Live and ApsaraVideo VOD and reuses core components to reduce the SDK package size by at least 50%.

Efficient Integration

Efficient Integration

You can integrate multiple SDKs at a time to improve integration efficiency and reduce R&D costs.

Flexible Access

Flexible Access

You can combine SDKs based on your business requirements and package the SDKs to make them suitable for your business scenarios.

Ease of Use

Ease of Use

ApsaraVideo MediaBox SDK provides product-level demos that support the full features of ApsaraVideo Live and ApsaraVideo VOD to meet your business requirements and facilitate integration.

Diverse Services

Diverse Services

A comprehensive metric system ensures service quality. Audio and video AI technologies are used to improve the utilization and distribution efficiency of audio and videos.

Stable Features

Stable Features

ApsaraVideo MediaBox SDK has been tested by hundreds of millions of requests to provide stable and reliable audio and video services. You can use ApsaraVideo MediaBox SDK to generate smooth and clear audio and videos.

Features

ApsaraVideo Player SDK

ApsaraVideo Player SDK is an adaptive audio and video playback tool that is developed by Alibaba Cloud. ApsaraVideo Player SDK provides convenient integration, coverage of multiple platforms, and excellent playback performance. It works with other cloud services to provide abundant playback features, diverse video security solutions, and comprehensive playback data services for a premium playback experience.

One-off Integration Covering Multiple Platforms

Supports multiple platforms such as web, Android, iOS, Flutter, Windows, macOS, and Linux. ApsaraVideo Player SDK provides multi-screen playback to meet your business requirements. ApsaraVideo Player SDK provides unified operations for different platforms. This simplifies the process of cross-platform integration.

Stable and Smooth Playback

Brings stable and reliable performance. ApsaraVideo Player SDK supports more than 100 million video views per day, an instant loading rate greater than 85%, and a stuttering duration of less than 100ms within 100 seconds. Comprehensive playback solutions and the device-adaptive software and hardware decoding feature ensure smooth playback.

Composite Video Security Solution

Supports HTTP-Live-Streaming (HLS) encryption, Alibaba Cloud proprietary cryptography, and digital rights management (DRM) encryption that are provided by ApsaraVideo VOD. You can use ApsaraVideo Player SDK to decode files that are encoded by using the preceding encryption methods to prevent infringements such as unauthorized download and spread.

Push SDK

Push SDK is a development tool that is used to ingest streams from clients based on the powerful content delivery network (CDN) and audio-video real-time communication technologies of Alibaba Cloud. Platforms such as Android, iOS, and Flutter are supported. Push SDK provides easy-to-use API operations, smooth and network-adaptive playback experience, multi-node-based latency optimization, and effective real-time retouching service.

Live Stream Ingest

Supports stream ingest over Real-Time Messaging Protocol (RTMP), stream pulling over RTMP, Flash Video (FLV), HTTP Live Streaming (HLS), and Alibaba Real-Time Communication (ARTC), and resolutions from 180p to 720p. Stream ingest is available in landscape, mirroring, audio-only, mute, and background modes.

Real-time Streaming (RTS)

Supports WebRTC stream ingest over UDP. You can add domain names for WebRTC stream ingest in the console. The strong resistance to network instability ensures a low latency within milliseconds.

Co-streaming

Provides co-streaming solutions based on Real-Time Communication (RTC). This helps implement interactive streaming that engages more users at an ultra-low latency.

Short Video SDK

The short video SDK is a developer tool that allows you to record, crop, edit, and upload videos. It provides stable and easy-to-use API operations in a unified format and an open source UI at the product level. You can use the features that are provided by the short video SDK based on your business requirements and modify the UI to implement custom development.

Video Recording

Allows you to configure recording settings such as multi-source recording, speed ramping, definition, and aspect ratio, and provides recording features such as real-time watermarks, real-time filters, real-time video effects, real-time retouching, and duet recording. The audio stream mixing, echo removal, and noise reduction features are also supported.

Video Template

Allows you to add music videos (MVs) and switch between different MVs on the editing UI, apply an Adobe After Effects template to the specified videos or images to create a personalized video, and crop multiple videos that use the same effects at a time.

Animated Stickers and Subtitles

Supports animated stickers, regular subtitles, bubble subtitles, word art, and subtitle flipping.

Queen SDK

Queen SDK provides retouching effects based on the proprietary IntelligentVision algorithm and various technologies for detecting and identifying faces and human body. This SDK supports a wide range of features including image retouching, face retouching, chroma key, stickers, makeup, gesture recognition, body shaping, hairdressing, and augmented reality (AR) writing.

Movement Detection

Accurately detects 18 key points on human body in real time and identifies 13 static postures and 9 movements based on proprietary algorithms. The static postures include standing upright, raising hands, hand heart, arms akimbo, and superman pose. The movements include rope jumping, jumping jack, squat, push-up, and sit-up. Feedback on the count of these postures and movements is provided in real time.

Real-time Filters

Provides more than 10 filters and continuously adds more filters. The real-time rendering feature styles up your videos and meets video processing requirements in different scenarios. Filters are rendered in real time to achieve natural effects. The filter library is continuously expanded and filter effects are continuously improved.

Animated Stickers

Uses the facial landmark detection technology and intelligent algorithms to attach stickers that seamlessly match up with faces and adapt to facial movements and expressions in real time. The sticker feature supports static and animated stickers and continuously expands the material library. This improves the diversity and interactive experience of your videos.

Blue-screen and Green-screen Image Matting

Supports blue-screen and green-screen image matting. Human bodies can be precisely recognized and extracted from the background from all angles, even if the background is busy and the human bodies are moving.

AR Writing

Recognizes the trajectory of finger key points by using gesture recognition algorithms. The content that you write is rendered in the video based on whether your hand starts or stops writing. This way, you can add the AR writing effect to videos. AR writing can be used in various scenarios such as live streaming, teaching, and online interaction.

Face Retouching

Provides five levels of smart face retouching and seven features for manual face retouching. Manual face retouching allows you to whiten and smoothen the skin, add rosy cheeks, sharpen images, remove blemishes, eye bags, and nasolabial folds, and whiten teeth. You can set the intensity level for each effect.

Audio Effect SDK

The audio effect SDK provides audio effects for scenarios such as films and television shows, real-time communication, and music understanding and processing. This allows you to enjoy high-quality audio and video interactions.

Dolby Atmos

Provides the Dolby Atmos technology throughout the production and playback of audio and video files. The audio effect SDK converts Dolby Atmos audio files that are encoded in the Ambisonics format into binaural audio tracks that contain complete spatial information by using the unique binaural rendering technology. When you use a virtual reality (VR) head or a mobile device with gyroscope sensors in free view mode, the audio effect SDK ensures that the locations of sound sources are fixed. This SDK is applicable for VR-based live streaming and VR-based video-on-demand (VOD) playback.

Surround Sound

Renders surround sound for the audio of films and television shows. The audio effect SDK uses unique acoustic algorithms to implement surround sound rendering on films and television shows that use stereo audio or the surround sound format such as 5.1 or 7.1. When you use earphones or speakers to play audio or videos on a device that supports binaural stereo audio, you can hear the sound from all directions. This brings a vivid, layered, and directional auditory experience at the cinema level.

Voice Enhancement

Provides the high-quality voice enhancement feature. This feature highlights the dialogue in noisy scenes of films and television shows without changing the tone. This way, you no longer miss the dialogue information.

Scenarios

ApsaraVideo MediaBox SDK is suitable for scenarios such as audio and video playback, live stream ingest, co-streaming, short video production, and retouching.

Media Playback

Media Playback

Try Now >
Live Stream Ingest and Co-streaming

Live Stream Ingest and Co-streaming

Try Now >
Short Video Production

Short Video Production

Try Now >
Retouching

Retouching

Try Now >

Documentation

Billing

Billable items and billing methods of ApsaraVideo MediaBox SDK

Before you use a module of ApsaraVideo MediaBox SDK, you must obtain a license for the module. The price of the license varies with the edition of the module and the features supported by the module. Some modules require you to obtain a license specific to the platform on which the modules run. You can create an application and select modules and their editions based on your business requirements.

Learn More >

Demo

Overview of ApsaraVideo MediaBox SDK

ApsaraVideo MediaBox SDK is an integrated audio and video service that is provided by Alibaba Cloud. It covers the core steps of live streaming and VOD services, such as video recording, video editing, video uploading, stream push, and playback. ApsaraVideo MediaBox SDK also integrates AI capabilities for audio and videos such as client-side audio and video enhancement and retouching.

Learn More >

Getting Started

Get started with ApsaraVideo MediaBox SDK

ApsaraVideo MediaBox SDK provides the following services: live stream ingest, co-streaming, short video, player, and retouching.

Learn More >
View All
phone Contact Us