Compare

Claude Opus 4.8 vs FLUX.2 [Pro] Preview

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search2/4
Claude Opus 4.8
anthropic/claude-opus-4.8
FLUX.2 [Pro] Preview
black-forest-labs/flux-2-pro-preview
Provider
Anthropic
Black Forest Labs
Family
Claude 4
FLUX
Modality
text
image
Context window
1,000,000 tok
Max output
128,000 tok
Released
2026-05-28
2026-05-27
License
Proprietary
Open-weight
Input price
$5.00 /1M
Output price
$25.00 /1M
Cache read
$0.500 /1M
Cache write
$6.25 /1M
Per image
$0.0000 /img
Tools
yes
Streaming
yes
Vision
yes
JSON mode
yes
Reasoning
yes
Prompt caching
yes
Batch API
yes
Try it
Open in playground →
Open in playground →
Claude Opus 4.8
anthropic/claude-opus-4.8
Full spec →

Claude Opus 4.8 is Anthropic's most capable generally available model, with a step-change improvement in agentic coding over Claude Opus 4.7. It uses adaptive thinking to calibrate reasoning per task and supports a one million token context window at standard pricing.

Strengths
  • Anthropic's most capable model — #1 on the Artificial Analysis Intelligence Index
  • Best computer-use / browser agent tested (84% on Online-Mind2Web)
  • Adaptive thinking — calibrates reasoning depth per task
Use cases
Autonomous coding agentsCodebase-scale migrationsComputer use / browser agentsHigh-stakes reasoning + analysisLong-document work (1M context)
FLUX.2 [Pro] Preview
black-forest-labs/flux-2-pro-preview
Full spec →

FLUX.2 [pro] Preview is Black Forest Labs' recommended default for production image generation and editing — tracks the latest [pro] weights with strong multi-reference support.

Strengths
  • Text-to-image generation
  • Creative control
Use cases
Marketing assetsProduct mockupsConcept art

Benchmarks

Claude Opus 4.8
FLUX.2 [Pro] Preview
AA Intelligence Index
61.0
Online-Mind2Web (computer use)
84.0

Source: each provider's published benchmarks. Higher is better. Run an eval to compare on your own data.

Compare with another

Claude Opus 4.8 vs Claude Sonnet 4.6
anthropic/claude-opus-4.8 · anthropic/claude-sonnet-4.6
Claude Opus 4.8 vs Grok 4.20 Multi-Agent
anthropic/claude-opus-4.8 · xai/grok-4.20-multi-agent-0309
Claude Opus 4.8 vs GPT-5.4
anthropic/claude-opus-4.8 · openai/gpt-5.4
Claude Opus 4.8 vs Gemini 2.5 Pro
anthropic/claude-opus-4.8 · google/gemini-2.5-pro
Claude Opus 4.8 vs Gemini 3.1 Pro
anthropic/claude-opus-4.8 · google/gemini-3.1-pro
Claude Opus 4.8 vs GPT-5.5 Pro
anthropic/claude-opus-4.8 · openai/gpt-5.5-pro
Claude Opus 4.8 vs Grok Imagine Image Quality
anthropic/claude-opus-4.8 · xai/grok-imagine-image-quality
Claude Opus 4.8 vs HappyHorse 1.0 T2V
anthropic/claude-opus-4.8 · alibaba/hh1-t2v
Claude Opus 4.8 vs Recraft V3
anthropic/claude-opus-4.8 · recraft/recraftv3
SWITCH BETWEEN THEM

One key, all 2, one line different.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Claude Opus 4.8
client.chat.completions.create(
    model="anthropic/claude-opus-4.8",
    messages=[{"role":"user","content":"hello"}],
)

# FLUX.2 [Pro] Preview
client.chat.completions.create(
    model="black-forest-labs/flux-2-pro-preview",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyRun an eval on these →