compare/Nemotron-3-120b-A12bvsClaude Haiku 4.5

Nemotron-3-120b-A12b vs Claude Haiku 4.5

Pricing, context window, capabilities, and release date — pulled from each provider's public docs. Both are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

RUN BOTH LIVE

Paste a prompt. Watch them race.

Both models stream in parallel through your own AIgateway key. Tokens, latency, and cost update as they arrive.

Sign in to runLive streaming uses your own key. It's free to sign up.
 Nemotron-3-120b-A12b
nvidia/nemotron-3-120b-a12b
Claude Haiku 4.5
anthropic/claude-haiku-4.5
ProviderNvidiaAnthropic
FamilyClaude 4
Modalitytexttext
Context window131,072 tok200,000 tok
Max output4,096 tok8,192 tok
Released2026-02-242026-04-13
Input price$0.500 /1M$1.00 /1M
Output price$1.20 /1M$5.00 /1M
Cache read$0.100 /1M
Toolsyesyes
Streamingyesyes
Visionyes
JSON modeyesyes
Reasoningyes
Prompt cachingyes
Nemotron-3-120b-A12b
nvidia/nemotron-3-120b-a12b
Full spec →

NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Claude Haiku 4.5
anthropic/claude-haiku-4.5
Full spec →

Claude Haiku 4.5 delivers similar levels of coding performance at one-third the cost and more than twice the speed of larger models.

Strengths
  • Fastest Claude
  • Cheap enough for per-token workloads
  • Still supports tools + vision
SWITCH BETWEEN THEM

One key, both models, one line different.

# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Try Nemotron-3-120b-A12b
client.chat.completions.create(
    model="nvidia/nemotron-3-120b-a12b",
    messages=[{"role":"user","content":"hello"}],
)

# Try Claude Haiku 4.5 — same client, same key
client.chat.completions.create(
    model="anthropic/claude-haiku-4.5",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyAdd a third model

Compare with another

Claude Haiku 4.5 vs GPT-5.4 Mini
anthropic/claude-haiku-4.5 · openai/gpt-5.4-mini
Gemini 3 Flash vs Claude Haiku 4.5
google/gemini-3-flash · anthropic/claude-haiku-4.5
GPT-5.5 Pro vs Nemotron-3-120b-A12b
openai/gpt-5.5-pro · nvidia/nemotron-3-120b-a12b
GPT-5.5 Pro vs Claude Haiku 4.5
openai/gpt-5.5-pro · anthropic/claude-haiku-4.5
GPT-5.5 vs Nemotron-3-120b-A12b
openai/gpt-5.5 · nvidia/nemotron-3-120b-a12b
GPT-5.5 vs Claude Haiku 4.5
openai/gpt-5.5 · anthropic/claude-haiku-4.5
Claude Opus 4.7 vs Nemotron-3-120b-A12b
anthropic/claude-opus-4.7 · nvidia/nemotron-3-120b-a12b
Claude Opus 4.7 vs Claude Haiku 4.5
anthropic/claude-opus-4.7 · anthropic/claude-haiku-4.5
Claude Sonnet 4.6 vs Nemotron-3-120b-A12b
anthropic/claude-sonnet-4.6 · nvidia/nemotron-3-120b-a12b