providers/Alibaba
Alibaba · Hangzhou, China

Alibaba models on AIgateway — pricing, context, capabilities

Alibaba ships 11 models on AIgateway spanning embedding, image, reasoning, text. Call any of them via the OpenAI-compatible endpoint at api.aigateway.sh/v1 with one key. Pass-through inference pricing plus a 5% platform fee at credit top-up. No per-call markups, no seat fees, no minimum.

Get your key →See pricingVisit Alibaba
models · 11modalities · embedding, image, reasoning, textlocation · Hangzhou, China
embedding

Alibaba embedding models

1 embedding model from Alibaba.

Qwen3-Embedding-0.6b
qwen/qwen3-embedding-0.6b
The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks.
see pricing8,192 ctx
image

Alibaba image models

1 image model from Alibaba.

Wan 2.6 Image
alibaba/wan-2.6-image
Alibaba's Wan 2.6 text-to-image model generating images from text prompts with optional negative prompts and customizable dimensions.
$0.030 / image
reasoning

Alibaba reasoning models

1 reasoning model from Alibaba.

Qwq-32b
qwen/qwq-32b
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
$0.200 in · $0.400 out / 1M24,000 ctx
text

Alibaba text models

8 text models from Alibaba.

Qwen2.5-Coder-32b-Instruct
★ featured
qwen/qwen2.5-coder-32b-instruct
Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:
$0.660 in · $1.00 out / 1M32,768 ctx
Qwen 3 Max
alibaba/qwen3-max
Alibaba's Qwen 3 Max is a large language model with strong coding, reasoning, and multilingual capabilities, served via DashScope's OpenAI-compatible endpoint.
$1.20 in · $6.00 out / 1M262,144 ctx
Qwen 3.5 397B A17B
alibaba/qwen3.5-397b-a17b
Alibaba's Qwen 3.5 is a 397B-parameter mixture-of-experts model with 17B active parameters, offering strong reasoning capabilities with efficient inference.
$0.600 in · $3.60 out / 1M262,144 ctx
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.
$0.051 in · $0.340 out / 1M32,768 ctx
Qwen1.5-0.5b-Chat
qwen/qwen1.5-0.5b-chat
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
$0.010 in · $0.020 out / 1M4,096 ctx
Qwen1.5-1.8b-Chat
qwen/qwen1.5-1.8b-chat
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
$0.020 in · $0.040 out / 1M4,096 ctx
Qwen1.5-14b-Chat-Awq
qwen/qwen1.5-14b-chat-awq
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization.
$0.120 in · $0.240 out / 1M4,096 ctx
Qwen1.5-7b-Chat-Awq
qwen/qwen1.5-7b-chat-awq
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization.
$0.060 in · $0.120 out / 1M4,096 ctx
About Alibaba

Who they are, what they focus on

Alibaba's Qwen team ships open-weight models across every size class. Qwen 3.5 (397B MoE) is the current flagship; Qwen2.5-Coder-32B is a code specialist; QwQ-32B is a reasoning model. Strong on multi-lingual workloads.

Headquartered in Hangzhou, China. Homepage: qwenlm.github.io.

FAQ

Common questions about Alibaba on AIgateway

Which Alibaba models does AIgateway support?
AIgateway routes 11 Alibaba models including Qwen2.5-Coder-32b-Instruct. Full catalog with pricing and context windows is in the sections above.
How do I call a Alibaba model from my code?
Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to the Alibaba slug (e.g. "qwen/qwen2.5-coder-32b-instruct"). Request and response shapes are identical to OpenAI.
How much do Alibaba models cost on AIgateway?
Pass-through Alibaba pricing plus a 5% platform fee applied at credit top-up, not per call. No seat fees, no minimum beyond the $5 top-up floor.
Can I bring my own Alibaba API key (BYOK)?
Yes. Attach your Alibaba key in the AIgateway dashboard. Calls to Alibaba models flip to pass-through and AIgateway waives the 5% platform fee on those calls.
Where is Alibaba based?
Alibaba is headquartered in Hangzhou, China.
Is there a free tier?
AIgateway's free tier is 100 requests/day on Kimi K2.6 — any account can test without a card. Paid Alibaba models require a $5 minimum credit top-up.
Other providers

Browse other labs

AnthropicOpenAIGooglexAIMoonshotDeepSeekMetaMistralDeepgramBlack Forest LabsBAAIAll providers →