Back
Mistral: Pixtral 12B
Mistral
Input: text
Input: image
Output: text
Released: Sep 10, 2024•Updated: Mar 28, 2025
The first multi-modal, text+image-to-text model from Mistral AI. Its weights were launched via torrent: https://x.com/mistralai/status/1833758285167722836.
32,768 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Hyperbolic | hyperbolic | 33K | - | $0.10/M | $0.10/M | 65.8 t/s | 1969 ms |
Mistral | mistral | 131K | - | $0.15/M | $0.15/M | 75.3 t/s | 642 ms |
Standard Pricing
Input Tokens
$0.0000001
per 1K tokens
Output Tokens
$0.0000001
per 1K tokens
Image Processing
$0.0001445
per image