Alibaba Cloud Model Studio integrates the complete Qwen model family and leading third-party models, covering multiple modalities and scenarios. Call models on demand without managing the underlying infrastructure, reducing your operational overhead.
Chat with an LLM to generate content, summaries, and more using just a few lines of code. Model Studio is compatible with OpenAI API specifications. Simply update the API key, base URL, and model name to migrate your existing OpenAI code to Model Studio. |
Model service
Model Studio provides out-of-the-box model services. Directly call Qwen models and third-party LLMs, such as DeepSeek and Kimi, with no deployment or maintenance required. Complete model list
Qwen flagship models:
Qwen-Max: The best-performing model in the Qwen3 series, suitable for complex, multi-step tasks.
Qwen-Plus: Offers a balance of performance, speed, and cost, making it the recommended choice for most scenarios.
The latest Qwen3.5-Plus series excels at various tasks, including language understanding, logical reasoning, code generation, agent tasks, and image and video understanding. We highly recommend this series for its versatility.
Qwen-Flash: Cost-effective and low-latency, suitable for simple tasks requiring fast responses.
Qwen-Coder: Excels at tool calling and environment interaction, and is specialized for code generation and understanding.
Multimodal coverage: Includes capabilities such as text generation, visual understanding, image generation, video generation, speech recognition and synthesis, and embedding.
Domain-specific models: For specific industries and tasks, provides domain models for long-text processing, translation, data mining, intent recognition, role-playing, and deep research.
Billing
Activating Model Studio does not incur any fees. You incur costs only when you call models. See Billable items and the Model pricing.
Free quota for new users
Model Studio provides new users with an exclusive free quota in the Singapore region. After the quota is exhausted, billing switches to pay-as-you-go. To avoid unexpected charges, turn on the Free quota only feature. The service will automatically stop when the quota runs out.
Payment methods
Model calls are automatically charged on an hourly basis. For supported payment methods, see Introduction to payment methods.
View bills and usage
Billing details: Go to the Billing Details and Cost Analysis pages.
Call statistics: Approximately one hour after making a model call, go to the Monitoring (Singapore),Monitoring (Virginia), or Monitoring (Beijing) page, set the filter criteria, and click Monitor in the Actions column. Then, view statistics such as call volume, token consumption, and success rate for the model. See Monitoring.
Coding Plan usage: If you have subscribed to a Coding Plan, view the quota consumption on the Coding Plan page. Coding Plan uses a fixed monthly fee and provides a monthly request quota for use in AI coding tools. See Coding Plan overview.
Get started with Model Studio
Try models online: Playground (Singapore), Playground (Virginia), or Playground (Beijing)
Make your first API request: Make the first call to a Qwen API