Question 1

What is Openhermes-2.5-Mistral-7b-Awq?

Accepted Answer

OpenHermes 2.5 Mistral 7B is a state of the art Mistral Fine-tune, a continuation of OpenHermes 2 model, which trained on additional code datasets. It is a text model from Hugging Face, accessible via AIgateway's OpenAI-compatible API at slug hf/thebloke/openhermes-2.5-mistral-7b-awq.

Question 2

How much does Openhermes-2.5-Mistral-7b-Awq cost via AIgateway?

Accepted Answer

Input costs $0.050 per 1M tokens; output costs $0.100 per 1M tokens. Pass-through plus a 5% platform fee applied at top-up, not per call.

Question 3

What is the context window of Openhermes-2.5-Mistral-7b-Awq?

Accepted Answer

4,096 tokens. Maximum output is 4,096 tokens.

Question 4

How do I call Openhermes-2.5-Mistral-7b-Awq from my code?

Accepted Answer

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "hf/thebloke/openhermes-2.5-mistral-7b-awq". The request and response shapes match OpenAI exactly.

Question 5

Does Openhermes-2.5-Mistral-7b-Awq support streaming, tool calling, vision, and JSON mode?

Accepted Answer

Streaming — yes. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

Question 6

What are the best use cases for Openhermes-2.5-Mistral-7b-Awq?

Accepted Answer

Chatbots, Content generation, Agentic workflows. Key strengths: General-purpose chat; Long context; Tool use.

Question 7

Can I bring my own Hugging Face API key (BYOK)?

Accepted Answer

Yes. Attach a Hugging Face key in your AIgateway dashboard and this model flips to pass-through — you pay Hugging Face directly and AIgateway waives the 5% platform fee on those calls.

Openhermes-2.5-Mistral-7b-Awq

Quickstart

Capabilities

Strengths

Use cases

Pricing

Collections

Call Openhermes-2.5-Mistral-7b-Awq from any OpenAI SDK

Request body

Response

Streaming (SSE) — set `"stream": true`

Quickstart

Errors

Openhermes-2.5-Mistral-7b-Awq

Quickstart

Capabilities

Strengths

Use cases

Pricing

Collections

Call Openhermes-2.5-Mistral-7b-Awq from any OpenAI SDK

Request body

Response

Streaming (SSE) — set "stream": true

Quickstart

Errors

Streaming (SSE) — set `"stream": true`