compare/Qwen3-30b-A3b-Fp8vsUform-Gen2-Qwen-500m

Qwen3-30b-A3b-Fp8 vs Uform-Gen2-Qwen-500m

Pricing, context window, capabilities, and release date — pulled from each provider's public docs. Both are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

RUN BOTH LIVE

Paste a prompt. Watch them race.

Both models stream in parallel through your own AIgateway key. Tokens, latency, and cost update as they arrive.

Sign in to runLive streaming uses your own key. It's free to sign up.
 Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
Uform-Gen2-Qwen-500m
unum/uform-gen2-qwen-500m
ProviderAlibaba QwenUnum
FamilyQwenQwen
Modalitytextvision
Context window32,768 tok4,096 tok
Max output4,096 tok4,096 tok
Released2025-04-302024-02-27
Input price$0.051 /1M$0.0000 /img
Output price$0.340 /1M
Cache read
Toolsyes
Streamingyesyes
Visionyes
JSON modeyes
Reasoningyes
Prompt caching
Qwen3-30b-A3b-Fp8
qwen/qwen3-30b-a3b-fp8
Full spec →

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

Strengths
  • Step-by-step reasoning
  • Chain-of-thought
Uform-Gen2-Qwen-500m
unum/uform-gen2-qwen-500m
Full spec →

UForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.

Strengths
  • Image understanding
  • Multimodal reasoning
SWITCH BETWEEN THEM

One key, both models, one line different.

# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Try Qwen3-30b-A3b-Fp8
client.chat.completions.create(
    model="qwen/qwen3-30b-a3b-fp8",
    messages=[{"role":"user","content":"hello"}],
)

# Try Uform-Gen2-Qwen-500m — same client, same key
client.chat.completions.create(
    model="unum/uform-gen2-qwen-500m",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyAdd a third model

Compare with another

Qwen3-Embedding-0.6b vs Uform-Gen2-Qwen-500m
qwen/qwen3-embedding-0.6b · unum/uform-gen2-qwen-500m
Deepseek-R1-Distill-Qwen-32b vs Uform-Gen2-Qwen-500m
deepseek/deepseek-r1-distill-qwen-32b · unum/uform-gen2-qwen-500m
Qwen1.5-0.5b-Chat vs Uform-Gen2-Qwen-500m
qwen/qwen1.5-0.5b-chat · unum/uform-gen2-qwen-500m
Qwen1.5-1.8b-Chat vs Uform-Gen2-Qwen-500m
qwen/qwen1.5-1.8b-chat · unum/uform-gen2-qwen-500m
Qwen1.5-14b-Chat-Awq vs Uform-Gen2-Qwen-500m
qwen/qwen1.5-14b-chat-awq · unum/uform-gen2-qwen-500m
Qwen1.5-7b-Chat-Awq vs Uform-Gen2-Qwen-500m
qwen/qwen1.5-7b-chat-awq · unum/uform-gen2-qwen-500m
Qwen2.5-Coder-32b-Instruct vs Uform-Gen2-Qwen-500m
qwen/qwen2.5-coder-32b-instruct · unum/uform-gen2-qwen-500m
Qwq-32b vs Uform-Gen2-Qwen-500m
qwen/qwq-32b · unum/uform-gen2-qwen-500m
Qwen 3 Max vs Uform-Gen2-Qwen-500m
alibaba/qwen3-max · unum/uform-gen2-qwen-500m