Text generation-Qwen
The commercial models of the Qwen series, boasts the latest capabilities and enhancements over its open source counterpart.
QwQ
QwQ reasoning model, trained based on Qwen2.5, has made significant improvements in reasoning capabilities by reinforcement learning. Its performance against core mathematic and coding metrics (AIME 24/25, LiveCodeBench) and general metrics (IFEval, LiveBench, etc.) have reached the level of DeepSeek-R1. Usage instructions
Name | Version | Context | Maximum input | Maximum CoT | Maximum response | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Version | Context | Maximum input | Maximum CoT | Maximum response | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwq-plus | Stable | 131,072 | 98,304 | 32,768 | 8,192 | $0.0008 | $0.0024 | 1 million tokens Valid for 180 days after activation |
Qwen-Max
Qwen-Max provides the best inference performance among Qwen models, especially for complex and multi-step tasks. Usage instructions | API reference | Try online
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwen-max | Stable | 32,768 | 30,720 | 8,192 | $0.0016 Batch: $0.0008 | $0.0064 Batch: $0.0032 | 1 million tokens each Valid for 180 days after activation |
qwen-max-latest | Latest | $0.0016 | $0.0064 |
qwen-max-2025-01-25 Also qwen-max-0125 or Qwen2.5-Max | Snapshot |
Qwen-Plus
Qwen-Plus provides a balanced combination of performance, speed, and cost, ideal for moderately complex tasks. Usage instructions | API reference | Try online
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwen-plus | Stable | 131,072 | 129,024 | 8,192 | $0.0004 Batch: $0.0002 | $0.0012 Batch: $0.0006 | 1 million tokens each Valid for 180 days after activation |
qwen-plus-latest | Latest | $0.0004 | $0.0012 |
qwen-plus-2025-01-25 Also qwen-plus-0125 | Snapshot |
Qwen-Turbo
Qwen-Turbo provides fast speed and low cost, suitable for simple tasks. Usage instructions | API reference | Try online
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwen-turbo | Stable | 1,008,192 | 1,000,000 | 8,192 | $0.00005 Batch: $0.000025 | $0.0002 Batch: $0.0001 | 1 million tokens each Valid for 180 days after activation |
qwen-turbo-latest | Latest | $0.00005 | $0.0002 |
qwen-turbo-2024-11-01 Also qwen-turbo-1101 | Snapshot |
Qwen-VL
Qwen-VL is a text generation model that can understand and process images. The model performs OCR operations and provides further functionalities, such as summarizing and reasoning. For example, it can extract product attributes from photos, and solving problems from images. Usage instructions | API reference | Try online
Qwen-VL is billed based on the total number of input and output tokens.
Image token calculation rule: Every 28 × 28 pixels count as 1 token. Each image converts to at least 4 tokens. For more information, see Calculate image tokens.
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Version | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwen-vl-max Enhanced capabilities of visual reasoning and instruction following compared with qwen-vl-plus. Best for complex tasks. | Stable | 32,768 | 30,720 Up to 16,384 tokens per image | 2,048 | $0.0008 | $0.0032 | 1 million tokens each Valid for 180 days after activation |
qwen-vl-plus Enhanced detail and text recognition capabilities, supporting images with over one million pixel resolution and any aspect ratio. Exceptional performance for various visual tasks. | Stable | $0.00021 | $0.00063 |
Qwen-MT
Qwen-MT is a large language model for machine translation built based on Qwen. It specializes in Chinese-English translation and multilingual translation between Chinese/English and 24 other languages, including Japanese, Korean, French, Spanish, German, Portuguese (Brazilian), Thai, Indonesian, Vietnamese, and Arabic. Qwen-MT also provides capabilities such as terminology intervention, domain prompting, and translation memory to enhance translation quality in complex scenarios. Usage instructions
Name | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
Name | Context | Maximum input | Maximum output | Input price | Output price | Free quota (Note) |
(Tokens) | (1,000 tokens) |
qwen-mt-plus | 2,048 | 1,024 | 1,024 | $0.00246 | $0.00737 | 500,000 tokens each Valid for 180 days after activation |
qwen-mt-turbo | $0.00016 | $0.00049 |
Text generation - Qwen - open source
In the model name, 'xxb' indicates the parameter scale. For example, 'qwen2-72b-instruct' has 72 billion parameters.
Model Studio facilitates the use of open source Qwen models without the need for local deployment. Qwen2 is most recommended among the open source models.
Qwen-Omni
Qwen-Omni is a omni-modal understanding and generation model trained on Qwen2.5. It can understand text, image, audio, and video swiftly. It can also generate text and voice simultaneously in stream. Usage instructions|API reference
Name | Context | Maximum input | Maximum output | Free quota (Note) |
(Tokens) |
Name | Context | Maximum input | Maximum output | Free quota (Note) |
(Tokens) |
qwen2.5-omni-7b | 32,768 | 30,720 | 2,048 | 1 million tokens (regardless of modality) Valid for 180 days after activation |
After the free quota runs out, you cannot access qwen2.5-omni-7b. Please stay tuned for updates.
Qwen2.5
Qwen2.5 is the latest series of the Qwen LLM. For Qwen2.5, we have launched a series of base and instruct models with parameter sizes ranging from 7 billion to 72 billion. Qwen2.5 has made the following improvements over Qwen2:
Qwen2.5 is pre-trained on our latest large-scale dataset containing 18 trillion tokens.
Thanks to our expert models in specific fields, Qwen2.5 has significantly increased knowledge and greatly improved coding and maths capabilities.
Qwen2.5 has shown significant improvements in following instructions, generating long texts (over 8K tokens), understanding structured data (such as tables), and generating structured outputs (especially JSON). It supports more diversified system prompts, enhancing its role-playing and conditional setting as a chatbot.
Qwen2.5 supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
Usage instructions | API reference | Try online
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
qwen2.5-14b-instruct-1m | 1,008,192 | 1,000,000 | 8,192 | Time-limited free trial |
qwen2.5-7b-instruct-1m |
qwen2.5-72b-instruct | 131,072 | 129,024 |
qwen2.5-32b-instruct |
qwen2.5-14b-instruct |
qwen2.5-7b-instruct |
Qwen2
The open-source Qwen2 models. Usage instructions | API reference | Try online
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
qwen2-72b-instruct | 131,072 | 128,000 | 6,144 | Time-limited free trial |
qwen2-57b-a14b-instruct | 65,536 | 63,488 |
qwen2-7b-instruct | 131,072 | 128,000 |
Qwen1.5
The open-source Qwen1.5 models. Usage instructions | API reference | Try online
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
qwen1.5-110b-chat | 8,000 | 6,000 | 2,000 | Time-limited free trial |
qwen1.5-72b-chat |
qwen1.5-32b-chat |
qwen1.5-14b-chat |
qwen1.5-7b-chat |
Qwen-VL - open source
The open-source version of Qwen-VL. Usage instructions | API reference
Qwen2.5-VL has made the following improvements over Qwen2-VL:
Richer perception of the world: Qwen2.5-VL is good at recognizing common objects such as flowers, birds, fish, and insects, as well as analyzing text, charts, icons, graphics, and layouts within images.
Long video understanding: Qwen2.5-VL can understand videos of up to 10 minutes. It can also pinpoint video segments to capture events.
Visual locating: Qwen2.5-VL can accurately locate objects in images by generating bounding boxes (coordinates for the top-left and bottom-right corners) or points (coordinates for the center of the bounding box). It can provide stable JSON outputs for these coordinates.
Structured output: Qwen2.5-VL supports structured output for data such as invoices, forms, and tables, suitable in finance, business, among other scenarios.
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
Name | Context | Maximum input | Maximum output | Input price | Output price |
(Tokens) | (1,000 tokens) |
qwen2.5-vl-72b-instruct | 131,072 | 129,024 Up to 16,384 per image | 8,192 | Time-limited free trial |
qwen2.5-vl-32b-instruct | Time-limited free trial After the free quota runs out, you cannot access the model. Stay tuned for future updates. |
qwen2.5-vl-7b-instruct | Time-limited free trial |
qwen2.5-vl-3b-instruct |