compare/Qwen 3 MaxvsQwen3-30b-A3b-Fp8

Qwen 3 Max vs Qwen3-30b-A3b-Fp8

Pricing, context window, capabilities, and release date — pulled from each provider's public docs. Both are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

RUN BOTH LIVE

Paste a prompt. Watch them race.

Both models stream in parallel through your own AIgateway key. Tokens, latency, and cost update as they arrive.

Sign in to runLive streaming uses your own key. It's free to sign up.
 Qwen 3 Max
alibaba/qwen3-max
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
ProviderAlibabaAlibaba Qwen
FamilyQwenQwen
Modalitytexttext
Context window262,144 tok131,072 tok
Max output4,096 tok4,096 tok
Released2026-05-222025-04-30
Input price$1.20 /1M$0.250 /1M
Output price$6.00 /1M$0.500 /1M
Cache read
Tools
Streamingyesyes
Vision
JSON mode
Reasoning
Prompt caching
Qwen 3 Max
alibaba/qwen3-max
Full spec →

Alibaba's Qwen 3 Max is a large language model with strong coding, reasoning, and multilingual capabilities, served via DashScope's OpenAI-compatible endpoint.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
Full spec →

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
SWITCH BETWEEN THEM

One key, both models, one line different.

# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Try Qwen 3 Max
client.chat.completions.create(
    model="alibaba/qwen3-max",
    messages=[{"role":"user","content":"hello"}],
)

# Try Qwen3-30b-A3b-Fp8 — same client, same key
client.chat.completions.create(
    model="qwen/qwen3-30b-a3b-fp8",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyAdd a third model

Compare with another

Qwen 3 Max vs GPT-5.4
alibaba/qwen3-max · openai/gpt-5.4
Qwen 3 Max vs Claude Opus 4.7
alibaba/qwen3-max · anthropic/claude-opus-4.7
Qwen3-30b-A3b-Fp8 vs Qwen3-Embedding-0.6b
qwen/qwen3-30b-a3b-fp8 · qwen/qwen3-embedding-0.6b
Qwen 3 Max vs Qwen3-Embedding-0.6b
alibaba/qwen3-max · qwen/qwen3-embedding-0.6b
Deepseek-R1-Distill-Qwen-32b vs Qwen3-30b-A3b-Fp8
deepseek/deepseek-r1-distill-qwen-32b · qwen/qwen3-30b-a3b-fp8
Qwen 3 Max vs Deepseek-R1-Distill-Qwen-32b
alibaba/qwen3-max · deepseek/deepseek-r1-distill-qwen-32b
Qwen2.5-Coder-32b-Instruct vs Qwen3-30b-A3b-Fp8
qwen/qwen2.5-coder-32b-instruct · qwen/qwen3-30b-a3b-fp8
Qwen 3 Max vs Qwen2.5-Coder-32b-Instruct
alibaba/qwen3-max · qwen/qwen2.5-coder-32b-instruct
Qwen3-30b-A3b-Fp8 vs Qwq-32b
qwen/qwen3-30b-a3b-fp8 · qwen/qwq-32b