Do Services-as-Software

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic coding workflows.

This model is part of the Qwen2.5-Coder family and offers strong compatibility with tools like vLLM for efficient deployment. Released under the Apache 2.0 license.

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Nebius AI Studio	nebiusAiStudio	33K	-	$0.01/M	$0.03/M	214.8 t/s	607 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

Nebius AI Studio

nebiusAiStudio

33K

$0.01/M

$0.03/M

214.8 t/s

607 ms

Qwen: Qwen2.5 Coder 7B Instruct

32,768 Token Context

Advanced Coding

Agentic Workflows

Available On

Do Work. With AI.