compare/Qwen 3 MaxvsQwen3-30b-A3b-Fp8

Qwen 3 Max vs Qwen3-30b-A3b-Fp8

Pricing, context window, capabilities, and release date — pulled from each provider's public docs. Both are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

RUN BOTH LIVE

Paste a prompt. Watch them race.

Both models stream in parallel through your own AIgateway key. Tokens, latency, and cost update as they arrive.

Sign in to runLive streaming uses your own key. It's free to sign up.
 Qwen 3 Max
alibaba/qwen3-max
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
ProviderAlibabaAlibaba Qwen
FamilyQwenQwen
Modalitytexttext
Context window262,144 tok32,768 tok
Max output4,096 tok4,096 tok
Released2026-04-152025-04-30
Input price$1.20 /1M$0.051 /1M
Output price$6.00 /1M$0.340 /1M
Cache read
Toolsyes
Streamingyesyes
Vision
JSON modeyes
Reasoningyesyes
Prompt caching
Qwen 3 Max
alibaba/qwen3-max
Full spec →

Alibaba's Qwen 3 Max is a large language model with strong coding, reasoning, and multilingual capabilities, served via DashScope's OpenAI-compatible endpoint.

Strengths
  • General-purpose chat
  • Streaming
  • Code generation and debugging
  • Step-by-step reasoning
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
Full spec →

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

Strengths
  • Step-by-step reasoning
  • Chain-of-thought
SWITCH BETWEEN THEM

One key, both models, one line different.

# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Try Qwen 3 Max
client.chat.completions.create(
    model="alibaba/qwen3-max",
    messages=[{"role":"user","content":"hello"}],
)

# Try Qwen3-30b-A3b-Fp8 — same client, same key
client.chat.completions.create(
    model="qwen/qwen3-30b-a3b-fp8",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyAdd a third model

Compare with another

GPT-5.4 vs Qwen 3 Max
openai/gpt-5.4 · alibaba/qwen3-max
Claude Opus 4.7 vs Qwen 3 Max
anthropic/claude-opus-4.7 · alibaba/qwen3-max
Qwen3-30b-A3b-Fp8 vs Uform-Gen2-Qwen-500m
qwen/qwen3-30b-a3b-fp8 · unum/uform-gen2-qwen-500m
Qwen 3 Max vs Uform-Gen2-Qwen-500m
alibaba/qwen3-max · unum/uform-gen2-qwen-500m
Qwen3-30b-A3b-Fp8 vs Qwen3-Embedding-0.6b
qwen/qwen3-30b-a3b-fp8 · qwen/qwen3-embedding-0.6b
Qwen 3 Max vs Qwen3-Embedding-0.6b
alibaba/qwen3-max · qwen/qwen3-embedding-0.6b
Deepseek-R1-Distill-Qwen-32b vs Qwen3-30b-A3b-Fp8
deepseek/deepseek-r1-distill-qwen-32b · qwen/qwen3-30b-a3b-fp8
Qwen 3 Max vs Deepseek-R1-Distill-Qwen-32b
alibaba/qwen3-max · deepseek/deepseek-r1-distill-qwen-32b
Qwen1.5-0.5b-Chat vs Qwen3-30b-A3b-Fp8
qwen/qwen1.5-0.5b-chat · qwen/qwen3-30b-a3b-fp8