Compare

Claude Opus 4.8 vs GPT-5.4

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search2/4
Claude Opus 4.8
anthropic/claude-opus-4.8
GPT-5.4
openai/gpt-5.4
Provider
Anthropic
OpenAI
Family
Claude 4
GPT-5
Modality
text
text
Context window
1,000,000 tok
1,000,000 tok
Max output
128,000 tok
16,384 tok
Released
2026-05-28
2026-05-22
License
Proprietary
Proprietary
Input price
$5.00 /1M
$2.50 /1M
Output price
$25.00 /1M
$15.00 /1M
Cache read
$0.500 /1M
$0.250 /1M
Cache write
$6.25 /1M
Tools
yes
yes
Streaming
yes
yes
Vision
yes
yes
JSON mode
yes
yes
Reasoning
yes
yes
Prompt caching
yes
yes
Batch API
yes
yes
Try it
Open in playground →
Open in playground →
Claude Opus 4.8
anthropic/claude-opus-4.8
Full spec →

Claude Opus 4.8 is Anthropic's most capable generally available model, with a step-change improvement in agentic coding over Claude Opus 4.7. It uses adaptive thinking to calibrate reasoning per task and supports a one million token context window at standard pricing.

Strengths
  • Anthropic's most capable model — #1 on the Artificial Analysis Intelligence Index
  • Best computer-use / browser agent tested (84% on Online-Mind2Web)
  • Adaptive thinking — calibrates reasoning depth per task
Use cases
Autonomous coding agentsCodebase-scale migrationsComputer use / browser agentsHigh-stakes reasoning + analysisLong-document work (1M context)
GPT-5.4
openai/gpt-5.4
Full spec →

GPT-5.4 is OpenAI's flagship model with strong coding, reasoning, and multimodal capabilities.

Strengths
  • Huge context window
  • Strong on math + code
  • Excellent JSON mode
Use cases
Long-document Q&AStructured extractionCode generationAnalytical work

Benchmarks

Claude Opus 4.8
GPT-5.4
AA Intelligence Index
61.0
GSM8K
98.2
HumanEval
94.0
MMLU
91.8
Online-Mind2Web (computer use)
84.0

Source: each provider's published benchmarks. Higher is better. Run an eval to compare on your own data.

Compare with another

GPT-5.4 vs Claude Opus 4.7
openai/gpt-5.4 · anthropic/claude-opus-4.7
GPT-5.4 vs Gemini 3.1 Pro
openai/gpt-5.4 · google/gemini-3.1-pro
Claude Sonnet 4.6 vs GPT-5.4
anthropic/claude-sonnet-4.6 · openai/gpt-5.4
GPT-5.4 vs Kimi-K2.6
openai/gpt-5.4 · moonshot/kimi-k2.6
GPT-5.4 vs Grok 4
openai/gpt-5.4 · xai/grok-4
GPT-5.4 vs Qwen 3 Max
openai/gpt-5.4 · alibaba/qwen3-max
Claude Opus 4.8 vs Claude Sonnet 4.6
anthropic/claude-opus-4.8 · anthropic/claude-sonnet-4.6
Claude Opus 4.8 vs Grok 4.20 Multi-Agent
anthropic/claude-opus-4.8 · xai/grok-4.20-multi-agent-0309
Claude Opus 4.8 vs Gemini 2.5 Pro
anthropic/claude-opus-4.8 · google/gemini-2.5-pro
SWITCH BETWEEN THEM

One key, all 2, one line different.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Claude Opus 4.8
client.chat.completions.create(
    model="anthropic/claude-opus-4.8",
    messages=[{"role":"user","content":"hello"}],
)

# GPT-5.4
client.chat.completions.create(
    model="openai/gpt-5.4",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyRun an eval on these →