Question 1

What is Qwen1.5-7b-Chat-Awq?

Accepted Answer

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. It is a text model from Alibaba Qwen, accessible via AIgateway's OpenAI-compatible API at slug qwen/qwen1.5-7b-chat-awq.

Question 2

How much does Qwen1.5-7b-Chat-Awq cost via AIgateway?

Accepted Answer

Input costs $0.060 per 1M tokens; output costs $0.120 per 1M tokens. Pass-through plus a 5% platform fee applied at top-up, not per call.

Question 3

What is the context window of Qwen1.5-7b-Chat-Awq?

Accepted Answer

4,096 tokens. Maximum output is 4,096 tokens.

Question 4

How do I call Qwen1.5-7b-Chat-Awq from my code?

Accepted Answer

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "qwen/qwen1.5-7b-chat-awq". The request and response shapes match OpenAI exactly.

Question 5

Does Qwen1.5-7b-Chat-Awq support streaming, tool calling, vision, and JSON mode?

Accepted Answer

Streaming — yes. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

Question 6

What are the best use cases for Qwen1.5-7b-Chat-Awq?

Accepted Answer

Chatbots, Content generation, Agentic workflows. Key strengths: General-purpose chat; Long context; Tool use.

Question 7

Can I bring my own Alibaba Qwen API key (BYOK)?

Accepted Answer

Yes. Attach a Alibaba Qwen key in your AIgateway dashboard and this model flips to pass-through — you pay Alibaba Qwen directly and AIgateway waives the 5% platform fee on those calls.

Qwen1.5-7b-Chat-Awq

Quickstart

Capabilities

Strengths

Use cases

Pricing

Collections

Call Qwen1.5-7b-Chat-Awq from any OpenAI SDK

Request body

Response

Streaming (SSE) — set `"stream": true`

Quickstart

Errors

Qwen1.5-7b-Chat-Awq

Quickstart

Capabilities

Strengths

Use cases

Pricing

Collections

Call Qwen1.5-7b-Chat-Awq from any OpenAI SDK

Request body

Response

Streaming (SSE) — set "stream": true

Quickstart

Errors

Streaming (SSE) — set `"stream": true`