Back
Qwen: QwQ 32B Preview
Qwen
Input: text
Output: text
Released: Nov 28, 2024•Updated: May 2, 2025
QwQ-32B-Preview is an experimental research model focused on AI reasoning capabilities developed by the Qwen Team. As a preview release, it demonstrates promising analytical abilities while having several important limitations:
- Language Mixing and Code-Switching: The model may mix languages or switch between them unexpectedly, affecting response clarity.
- Recursive Reasoning Loops: The model may enter circular reasoning patterns, leading to lengthy responses without a conclusive answer.
- Safety and Ethical Considerations: The model requires enhanced safety measures to ensure reliable and secure performance, and users should exercise caution when deploying it.
- Performance and Benchmark Limitations: The model excels in math and coding but has room for improvement in other areas, such as common sense reasoning and nuanced language understanding.
32,768 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Nebius AI Studio | nebiusAiStudio | 33K | - | $0.09/M | $0.27/M | 57.4 t/s | 533 ms |
Hyperbolic | hyperbolic | 33K | - | $0.20/M | $0.20/M | 59.7 t/s | 1150 ms |
Standard Pricing
Input Tokens
$0.00000009
per 1K tokens
Output Tokens
$0.00000027
per 1K tokens