Back

Qwen: QwQ 32B (free)

Qwen
Input: text
Output: text
Released: Mar 5, 2025Updated: May 2, 2025

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

40,000 Token Context

Process and analyze large documents and conversations.

Hybrid Reasoning

Choose between rapid responses and extended, step-by-step processing for complex tasks.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra131K-$0.15/M$0.20/M50.6 t/s482 ms
NebiusnebiusAiStudio131K-$0.15/M$0.45/M42.1 t/s511 ms
InferenceNetinferenceNet16K16K$0.20/M$0.20/M--
Groqgroq131K131K$0.29/M$0.39/M540.8 t/s561 ms
Hyperbolichyperbolic131K-$0.40/M$0.40/M43.4 t/s1262 ms
SambaNovasambaNova16K4K$0.50/M$1.00/M276.2 t/s831 ms
NebiusnebiusAiStudio (fast)131K-$0.50/M$1.50/M81.7 t/s399 ms
Cent-MLcentMl41K41K$0.65/M$0.65/M105.3 t/s622 ms
Fireworksfireworks131K-$0.90/M$0.90/M179.0 t/s564 ms
Togethertogether131K33K$1.20/M$1.20/M89.8 t/s586 ms
Standard Pricing

Do Work. With AI.