Today, DeepSeek released two models, V4-Pro and V4-Flash. Their architecture and technical advantages can be summarized as follows:
This brings improvements in model performance and cost-effectiveness, including:
DeepSeek-V4 supports the OpenAI ChatCompletions interface and the Anthropic interface. When calling the new model API, the Model parameter needs to be changed to deepseek-v4-pro or deepseek-v4-flash.
Alibaba Cloud AI Gateway provides management capabilities for Model API, Agent API, and MCP Server, and now supports management of the DeepSeek-V4 API first. Through Alibaba Cloud AI Gateway, you can call DeepSeek-V4 API services, including thinking, multi-turn dialogue, Tool Call, Anthropic /v1/messages compatible calls, and more. It also supports integration of DeepSeek-V4 on Claude Code, and additionally implements fallback capabilities between DeepSeek-V4 and other models such as Qwen.
Open the AI Gateway page, click to enter the console, and click the target instance ID. In the left navigation bar, click Model API, then click Create Model API.

After entering the Create Model API form, you can configure it as follows:

BasePath must be unique./. You can choose whether to enable remove when forwarding to backend services.After configuration, run a test case:


Building Cross-Cloud Observability: One Architecture, Unified Analytics
721 posts | 58 followers
FollowFarruh - May 26, 2026
Alibaba Cloud Native Community - February 13, 2026
Alibaba Cloud Native Community - February 13, 2025
Alibaba Cloud Native Community - March 10, 2025
Alibaba Container Service - July 10, 2025
Alibaba Container Service - May 27, 2025
721 posts | 58 followers
Follow
Alibaba Cloud Model Studio
A one-stop generative AI platform to build intelligent applications that understand your business, based on Qwen model series such as Qwen-Max and other popular models
Learn More
Qwen
Full-range, open-source, multimodal, and multi-functional
Learn More
Alibaba Cloud for Generative AI
Accelerate innovation with generative AI to create new business success
Learn More
AI Acceleration Solution
Accelerate AI-driven business and AI model training and inference with Alibaba Cloud GPU technology
Learn MoreMore Posts by Alibaba Cloud Native Community