Back

Qwen: QwQ 32B

Qwen
Input: text
Output: text
Released: Mar 5, 2025Updated: May 2, 2025

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

131,072 Token Context

Process and analyze large documents and conversations.

Hybrid Reasoning

Choose between rapid responses and extended, step-by-step processing for complex tasks.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra131K-$0.15/M$0.20/M44.1 t/s852 ms
Nebius AI StudionebiusAiStudio131K-$0.15/M$0.45/M55.7 t/s775 ms
NovitaAInovitaAi33K16K$0.18/M$0.20/M33.9 t/s994 ms
inference.netinferenceNet16K16K$0.20/M$0.20/M--
Groqgroq131K131K$0.29/M$0.39/M535.3 t/s802 ms
Hyperbolichyperbolic131K-$0.40/M$0.40/M54.6 t/s1287 ms
SambaNovasambaNova16K4K$0.50/M$1.00/M245.4 t/s653 ms
Nebius AI Studio (Fast)nebiusAiStudio (fast)131K-$0.50/M$1.50/M87.4 t/s575 ms
CentMLcentMl41K41K$0.65/M$0.65/M65.9 t/s776 ms
Fireworksfireworks131K-$0.90/M$0.90/M481.5 t/s7112 ms
Togethertogether131K33K$1.20/M$1.20/M98.6 t/s722 ms
Standard Pricing
Input Tokens
$0.00000015

per 1K tokens

Output Tokens
$0.0000002

per 1K tokens

Do Work. With AI.