Alibaba Cloud revealed this week its latest open-source initiative to spur development of video generation AI models.
It open sourced a set of toolkits on its AI model community ModelScope that power the development of text-to-video models, including data processing tools, multimodal datasets, foundation models, training and inference tools.
Video generation models require massive amounts of high-quality training data and advanced processing tools for multimodal dataset.
To tackle the data processing challenge, Alibaba Cloud open sourced Data-Juicer, a one-stop data processing system that contains hundreds of dedicated video, image, audio, text, and other multi-modal data processing operators and tools.
It also open-sourced a denoising foundation model built on the basis of a small dataset. Developers can tap the foundation model for advanced model training to develop video generation models.
Since its launch, over 4 million developers have tapped ModelScope to gain access to more than 3,000 models and thousands of datasets.
Learning about AIACC-Training | Startup Commands and Environment Variables
Learning about AIACC-ACSpeed | Install and Use AIACC-ACSpeed
1,044 posts | 257 followers
FollowAlibaba Cloud Community - June 25, 2024
Changyi - February 16, 2020
Alibaba Cloud Community - September 19, 2024
Merchine Learning PAI - February 25, 2021
Alibaba Cloud Community - August 25, 2022
Alibaba Cloud Community - May 9, 2022
1,044 posts | 257 followers
FollowOffline SDKs for visual production, such as image segmentation, video segmentation, and character recognition, based on deep learning technologies developed by Alibaba Cloud.
Learn MoreAlibaba Cloud provides big data consulting services to help enterprises leverage advanced data technology.
Learn MoreAccelerate AI-driven business and AI model training and inference with Alibaba Cloud GPU technology
Learn MoreTop-performance foundation models from Alibaba Cloud
Learn MoreMore Posts by Alibaba Cloud Community