Back
Magnum v4 72B
Qwen
Input: text
Output: text
Released: Oct 22, 2024•Updated: Mar 28, 2025
This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus).
The model is fine-tuned on top of Qwen2.5 72B.
16,384 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Mancer (private) | mancer (private) | 16K | 1K | $2.50/M | $3.00/M | 16.3 t/s | 586 ms |
Infermatic | infermatic | 33K | - | $3.00/M | $3.00/M | 17.4 t/s | 3264 ms |
Featherless | featherless | 16K | 4K | $4.00/M | $6.00/M | 13.8 t/s | 8830 ms |
Standard Pricing
Input Tokens
$0.0000025
per 1K tokens
Output Tokens
$0.000003
per 1K tokens