Back
Qwen: QwQ 32B (free)
Qwen
Input: text
Output: text
Released: Mar 5, 2025•Updated: May 2, 2025
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
40,000 Token Context
Process and analyze large documents and conversations.
Hybrid Reasoning
Choose between rapid responses and extended, step-by-step processing for complex tasks.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
DeepInfra | deepInfra | 131K | - | $0.15/M | $0.20/M | 50.6 t/s | 482 ms |
Nebius | nebiusAiStudio | 131K | - | $0.15/M | $0.45/M | 42.1 t/s | 511 ms |
InferenceNet | inferenceNet | 16K | 16K | $0.20/M | $0.20/M | - | - |
Groq | groq | 131K | 131K | $0.29/M | $0.39/M | 540.8 t/s | 561 ms |
Hyperbolic | hyperbolic | 131K | - | $0.40/M | $0.40/M | 43.4 t/s | 1262 ms |
SambaNova | sambaNova | 16K | 4K | $0.50/M | $1.00/M | 276.2 t/s | 831 ms |
Nebius | nebiusAiStudio (fast) | 131K | - | $0.50/M | $1.50/M | 81.7 t/s | 399 ms |
Cent-ML | centMl | 41K | 41K | $0.65/M | $0.65/M | 105.3 t/s | 622 ms |
Fireworks | fireworks | 131K | - | $0.90/M | $0.90/M | 179.0 t/s | 564 ms |
Together | together | 131K | 33K | $1.20/M | $1.20/M | 89.8 t/s | 586 ms |
Standard Pricing