Back

Qwen: Qwen3 14B

Qwen3
Input: text
Output: text
Released: Apr 28, 2025Updated: May 12, 2025

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. The model is fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

40,960 Token Context

Process and analyze large documents and conversations.

Hybrid Reasoning

Choose between rapid responses and extended, step-by-step processing for complex tasks.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra41K41K$0.07/M$0.24/M70.4 t/s610 ms
NovitaAInovitaAi41K41K$0.07/M$0.28/M57.9 t/s889 ms
Nebius AI StudionebiusAiStudio41K-$0.08/M$0.24/M89.8 t/s384 ms
Standard Pricing
Input Tokens
$0.00000007

per 1K tokens

Output Tokens
$0.00000024

per 1K tokens

Do Work. With AI.