Back

Mistral: Pixtral 12B

Mistral
Input: text
Input: image
Output: text
Released: Sep 10, 2024Updated: Mar 28, 2025

The first multi-modal, text+image-to-text model from Mistral AI. Its weights were launched via torrent: https://x.com/mistralai/status/1833758285167722836.

32,768 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Hyperbolichyperbolic33K-$0.10/M$0.10/M65.8 t/s1969 ms
Mistralmistral131K-$0.15/M$0.15/M75.3 t/s642 ms
Standard Pricing
Input Tokens
$0.0000001

per 1K tokens

Output Tokens
$0.0000001

per 1K tokens

Image Processing
$0.0001445

per image

Do Work. With AI.