Models

Updated at: 2025-03-27 09:07

Alibaba Cloud Model Studio offers a wide variety of models. This topic describes all supported models in Model Studio.

Flagship model

Flagship models

通义new Qwen-Max

Best inference performance

通义new Qwen-Plus

Balanced performance, speed and cost

通义new Qwen-Turbo

Fast speed and low cost

Maximum context

(Tokens)

32,768

131,072

1,008,192

Minimum input price

(1,000 tokens)

$0.0016

$0.0004

$0.00005

Minimum output price

(1,000 tokens)

$0.0064

$0.0012

$0.0002

Model overview

Category

Model

Description

Category

Model

Description

Text generation

Qwen

Embedding

Text embedding

Converts text into numerical representations, suitable for search, clustering, recommendation, and classification tasks.

Text generation-Qwen

The commercial models of the Qwen series, boasts the latest capabilities and enhancements over its open source counterpart.

QwQ

QwQ reasoning model, trained based on Qwen2.5, has made significant improvements in reasoning capabilities by reinforcement learning. Its performance against core mathematic and coding metrics (AIME 24/25, LiveCodeBench) and general metrics (IFEval, LiveBench, etc.) have reached the level of DeepSeek-R1. Usage instructions

Name

Version

Context

Maximum input

Maximum CoT

Maximum response

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Version

Context

Maximum input

Maximum CoT

Maximum response

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwq-plus

Stable

131,072

98,304

32,768

8,192

$0.0008

$0.0024

1 million tokens

Valid for 180 days after activation

Qwen-Max

Qwen-Max provides the best inference performance among Qwen models, especially for complex and multi-step tasks. Usage instructions | API reference | Try online

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwen-max

Stable

32,768

30,720

8,192

$0.0016

Batch: $0.0008

$0.0064

Batch: $0.0032

1 million tokens each

Valid for 180 days after activation

qwen-max-latest

Latest

$0.0016

$0.0064

qwen-max-2025-01-25

Also qwen-max-0125 or Qwen2.5-Max

Snapshot

Qwen-Plus

Qwen-Plus provides a balanced combination of performance, speed, and cost, ideal for moderately complex tasks. Usage instructions | API reference | Try online

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwen-plus

Stable

131,072

129,024

8,192

$0.0004

Batch: $0.0002

$0.0012

Batch: $0.0006

1 million tokens each

Valid for 180 days after activation

qwen-plus-latest

Latest

$0.0004

$0.0012

qwen-plus-2025-01-25

Also qwen-plus-0125

Snapshot

Qwen-Turbo

Qwen-Turbo provides fast speed and low cost, suitable for simple tasks. Usage instructions | API reference | Try online

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwen-turbo

Stable

1,008,192

1,000,000

8,192

$0.00005

Batch: $0.000025

$0.0002

Batch: $0.0001

1 million tokens each

Valid for 180 days after activation

qwen-turbo-latest

Latest

$0.00005

$0.0002

qwen-turbo-2024-11-01

Also qwen-turbo-1101

Snapshot

Qwen-VL

Qwen-VL is a text generation model that can understand and process images. The model performs OCR operations and provides further functionalities, such as summarizing and reasoning. For example, it can extract product attributes from photos, and solving problems from images. Usage instructions | API reference | Try online

Qwen-VL is billed based on the total number of input and output tokens.
Image token calculation rule: Every 28 × 28 pixels count as 1 token. Each image converts to at least 4 tokens. For more information, see Calculate image tokens.

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Version

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwen-vl-max

Enhanced capabilities of visual reasoning and instruction following compared with qwen-vl-plus. Best for complex tasks.

Stable

32,768

30,720

Up to 16,384 tokens per image

2,048

$0.0008

$0.0032

1 million tokens each

Valid for 180 days after activation

qwen-vl-plus

Enhanced detail and text recognition capabilities, supporting images with over one million pixel resolution and any aspect ratio. Exceptional performance for various visual tasks.

Stable

$0.00021

$0.00063

Qwen-MT

Qwen-MT is a large language model for machine translation built based on Qwen. It specializes in Chinese-English translation and multilingual translation between Chinese/English and 24 other languages, including Japanese, Korean, French, Spanish, German, Portuguese (Brazilian), Thai, Indonesian, Vietnamese, and Arabic. Qwen-MT also provides capabilities such as terminology intervention, domain prompting, and translation memory to enhance translation quality in complex scenarios. Usage instructions

Name

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

Name

Context

Maximum input

Maximum output

Input price

Output price

Free quota

(Note)

(Tokens)

(1,000 tokens)

qwen-mt-plus

2,048

1,024

1,024

$0.00246

$0.00737

500,000 tokens each

Valid for 180 days after activation

qwen-mt-turbo

$0.00016

$0.00049

Text generation - Qwen - open source

  • In the model name, 'xxb' indicates the parameter scale. For example, 'qwen2-72b-instruct' has 72 billion parameters.

  • Model Studio facilitates the use of open source Qwen models without the need for local deployment. Qwen2 is most recommended among the open source models.

Qwen-Omni

Qwen-Omni is a omni-modal understanding and generation model trained on Qwen2.5. It can understand text, image, audio, and video swiftly. It can also generate text and voice simultaneously in stream. Usage instructionsAPI reference

Name

Context

Maximum input

Maximum output

Free quota

(Note)

(Tokens)

Name

Context

Maximum input

Maximum output

Free quota

(Note)

(Tokens)

qwen2.5-omni-7b

32,768

30,720

2,048

1 million tokens (regardless of modality)

Valid for 180 days after activation

After the free quota runs out, you cannot access qwen2.5-omni-7b. Please stay tuned for updates.

Qwen2.5

Qwen2.5 is the latest series of the Qwen LLM. For Qwen2.5, we have launched a series of base and instruct models with parameter sizes ranging from 7 billion to 72 billion. Qwen2.5 has made the following improvements over Qwen2:

  • Qwen2.5 is pre-trained on our latest large-scale dataset containing 18 trillion tokens.

  • Thanks to our expert models in specific fields, Qwen2.5 has significantly increased knowledge and greatly improved coding and maths capabilities.

  • Qwen2.5 has shown significant improvements in following instructions, generating long texts (over 8K tokens), understanding structured data (such as tables), and generating structured outputs (especially JSON). It supports more diversified system prompts, enhancing its role-playing and conditional setting as a chatbot.

  • Qwen2.5 supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Usage instructions | API reference | Try online

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

qwen2.5-14b-instruct-1m

1,008,192

1,000,000

8,192

Time-limited free trial

qwen2.5-7b-instruct-1m

qwen2.5-72b-instruct

131,072

129,024

qwen2.5-32b-instruct

qwen2.5-14b-instruct

qwen2.5-7b-instruct

Qwen2

The open-source Qwen2 models. Usage instructions | API reference | Try online

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

qwen2-72b-instruct

131,072

128,000

6,144

Time-limited free trial

qwen2-57b-a14b-instruct

65,536

63,488

qwen2-7b-instruct

131,072

128,000

Qwen1.5

The open-source Qwen1.5 models. Usage instructions | API reference | Try online

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

qwen1.5-110b-chat

8,000

6,000

2,000

Time-limited free trial

qwen1.5-72b-chat

qwen1.5-32b-chat

qwen1.5-14b-chat

qwen1.5-7b-chat

Qwen-VL - open source

The open-source version of Qwen-VL. Usage instructions | API reference

Qwen2.5-VL has made the following improvements over Qwen2-VL:

  • Richer perception of the world: Qwen2.5-VL is good at recognizing common objects such as flowers, birds, fish, and insects, as well as analyzing text, charts, icons, graphics, and layouts within images.

  • Long video understanding: Qwen2.5-VL can understand videos of up to 10 minutes. It can also pinpoint video segments to capture events.

  • Visual locating: Qwen2.5-VL can accurately locate objects in images by generating bounding boxes (coordinates for the top-left and bottom-right corners) or points (coordinates for the center of the bounding box). It can provide stable JSON outputs for these coordinates.

  • Structured output: Qwen2.5-VL supports structured output for data such as invoices, forms, and tables, suitable in finance, business, among other scenarios.

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

Name

Context

Maximum input

Maximum output

Input price

Output price

(Tokens)

(1,000 tokens)

qwen2.5-vl-72b-instruct

131,072

129,024

Up to 16,384 per image

8,192

Time-limited free trial

qwen2.5-vl-32b-instruct

Time-limited free trial

After the free quota runs out, you cannot access the model. Stay tuned for future updates.

qwen2.5-vl-7b-instruct

Time-limited free trial

qwen2.5-vl-3b-instruct

Text embedding

Converts text into numerical representations, suitable for search, clustering, recommendation, and classification tasks. Billed based on the number of input tokens. API reference

Name

Vector dimensions

Maximum rows

Maximum tokens per row

Supported languages

Price

(1,000 input tokens)

Free quota

(Note)

Name

Vector dimensions

Maximum rows

Maximum tokens per row

Supported languages

Price

(1,000 input tokens)

Free quota

(Note)

text-embedding-v3

1,024 (default), 768 or 512

10

8,192

Chinese, English, Spanish, French, Portuguese, Indonesian, Japanese, Korean, German, Russian, and more than 50 other languages

Time-limited free trial

500,000 tokens

Valid for 180 days after activation

  • On this page (1)
  • Flagship model
  • Model overview
  • Text generation-Qwen
  • QwQ
  • Qwen-Max
  • Qwen-Plus
  • Qwen-Turbo
  • Qwen-VL
  • Qwen-MT
  • Text generation - Qwen - open source
  • Qwen-Omni
  • Qwen2.5
  • Qwen2
  • Qwen1.5
  • Qwen-VL - open source
  • Text embedding
Feedback
phone Contact Us

Chat now with Alibaba Cloud Customer Service to assist you in finding the right products and services to meet your needs.

alicare alicarealicarealicare