Compare

Llama-3.1-8b-Instruct-Fp8 — and what?

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 1 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search1/4
Llama-3.1-8b-Instruct-Fp8
meta/llama-3.1-8b-instruct-fp8
Provider
Meta
Family
Llama 3
Modality
text
Context window
131,072 tok
Max output
4,096 tok
Released
2024-07-25
License
Open-weight
Input price
$0.050 /1M
Output price
$0.100 /1M
Tools
Streaming
yes
Vision
JSON mode
Reasoning
Prompt caching
Batch API
Try it
Open in playground →
Llama-3.1-8b-Instruct-Fp8
meta/llama-3.1-8b-instruct-fp8
Full spec →

Llama 3.1 8B quantized to FP8 precision

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Use cases
ChatbotsContent generationAgentic workflows

Compare with another

Llama-3.1-8b-Instruct-Fp8 vs Llama-3.2-3b-Instruct
meta/llama-3.1-8b-instruct-fp8 · meta/llama-3.2-3b-instruct
Llama-3.1-8b-Instruct-Fp8 vs Llama-3.2-1b-Instruct
meta/llama-3.1-8b-instruct-fp8 · meta/llama-3.2-1b-instruct
Llama-3.1-8b-Instruct-Fp8 vs Llama-3.3-70b-Instruct-Fp8-Fast
meta/llama-3.1-8b-instruct-fp8 · meta/llama-3.3-70b-instruct-fp8-fast