Back
Qwen: QwQ 32B
Qwen
Input: text
Output: text
Released: Mar 5, 2025•Updated: May 2, 2025
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
131,072 Token Context
Process and analyze large documents and conversations.
Hybrid Reasoning
Choose between rapid responses and extended, step-by-step processing for complex tasks.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
DeepInfra | deepInfra | 131K | - | $0.15/M | $0.20/M | 44.1 t/s | 852 ms |
Nebius AI Studio | nebiusAiStudio | 131K | - | $0.15/M | $0.45/M | 55.7 t/s | 775 ms |
NovitaAI | novitaAi | 33K | 16K | $0.18/M | $0.20/M | 33.9 t/s | 994 ms |
inference.net | inferenceNet | 16K | 16K | $0.20/M | $0.20/M | - | - |
Groq | groq | 131K | 131K | $0.29/M | $0.39/M | 535.3 t/s | 802 ms |
Hyperbolic | hyperbolic | 131K | - | $0.40/M | $0.40/M | 54.6 t/s | 1287 ms |
SambaNova | sambaNova | 16K | 4K | $0.50/M | $1.00/M | 245.4 t/s | 653 ms |
Nebius AI Studio (Fast) | nebiusAiStudio (fast) | 131K | - | $0.50/M | $1.50/M | 87.4 t/s | 575 ms |
CentML | centMl | 41K | 41K | $0.65/M | $0.65/M | 65.9 t/s | 776 ms |
Fireworks | fireworks | 131K | - | $0.90/M | $0.90/M | 481.5 t/s | 7112 ms |
Together | together | 131K | 33K | $1.20/M | $1.20/M | 98.6 t/s | 722 ms |
Standard Pricing
Input Tokens
$0.00000015
per 1K tokens
Output Tokens
$0.0000002
per 1K tokens