compare/Mistral-7b-Instruct-V0.1-AwqvsMistral Small 4

Mistral-7b-Instruct-V0.1-Awq vs Mistral Small 4

Pricing, context window, capabilities, and release date — pulled from each provider's public docs. Both are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

RUN BOTH LIVE

Paste a prompt. Watch them race.

Both models stream in parallel through your own AIgateway key. Tokens, latency, and cost update as they arrive.

Sign in to runLive streaming uses your own key. It's free to sign up.
 Mistral-7b-Instruct-V0.1-Awq
hf/thebloke/mistral-7b-instruct-v0.1-awq
Mistral Small 4
mistral/mistral-small-4-0-26-03
ProviderHugging FaceMistral
FamilyMistralMistral
Modalitytexttext
Context window4,096 tok131,072 tok
Max output4,096 tok16,384 tok
Released2023-09-272026-03-01
Input price$0.050 /1M$0.200 /1M
Output price$0.100 /1M$0.600 /1M
Cache read
Toolsyes
Streamingyesyes
Visionyes
JSON modeyes
Reasoning
Prompt caching
Mistral-7b-Instruct-V0.1-Awq
hf/thebloke/mistral-7b-instruct-v0.1-awq
Full spec →

Mistral 7B Instruct v0.1 AWQ is an efficient, accurate and blazing-fast low-bit weight quantized Mistral variant.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Mistral Small 4
mistral/mistral-small-4-0-26-03
Full spec →

Mistral Small 4 (Mar 2026) — compact frontier-class text model.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
SWITCH BETWEEN THEM

One key, both models, one line different.

# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Try Mistral-7b-Instruct-V0.1-Awq
client.chat.completions.create(
    model="hf/thebloke/mistral-7b-instruct-v0.1-awq",
    messages=[{"role":"user","content":"hello"}],
)

# Try Mistral Small 4 — same client, same key
client.chat.completions.create(
    model="mistral/mistral-small-4-0-26-03",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyAdd a third model

Compare with another

Mistral-7b-Instruct-V0.2 vs Mistral-7b-Instruct-V0.1-Awq
hf/mistral/mistral-7b-instruct-v0.2 · hf/thebloke/mistral-7b-instruct-v0.1-awq
Mistral-7b-Instruct-V0.2 vs Mistral Small 4
hf/mistral/mistral-7b-instruct-v0.2 · mistral/mistral-small-4-0-26-03
Hermes-2-Pro-Mistral-7b vs Mistral-7b-Instruct-V0.1-Awq
hf/nousresearch/hermes-2-pro-mistral-7b · hf/thebloke/mistral-7b-instruct-v0.1-awq
Hermes-2-Pro-Mistral-7b vs Mistral Small 4
hf/nousresearch/hermes-2-pro-mistral-7b · mistral/mistral-small-4-0-26-03
Mistral-7b-Instruct-V0.1-Awq vs Openhermes-2.5-Mistral-7b-Awq
hf/thebloke/mistral-7b-instruct-v0.1-awq · hf/thebloke/openhermes-2.5-mistral-7b-awq
Mistral-7b-Instruct-V0.1-Awq vs Voxtral Mini Transcribe Realtime
hf/thebloke/mistral-7b-instruct-v0.1-awq · mistral/voxtral-mini-transcribe-realtime-26-02
Mistral-7b-Instruct-V0.1-Awq vs Voxtral Mini Transcribe
hf/thebloke/mistral-7b-instruct-v0.1-awq · mistral/voxtral-mini-transcribe-26-02
Mistral-7b-Instruct-V0.1-Awq vs Voxtral TTS
hf/thebloke/mistral-7b-instruct-v0.1-awq · mistral/voxtral-tts-26-03
Openhermes-2.5-Mistral-7b-Awq vs Mistral Small 4
hf/thebloke/openhermes-2.5-mistral-7b-awq · mistral/mistral-small-4-0-26-03