Back

Mistral: Pixtral 12B

Mistral
Input: text
Input: image
Output: text
Released: Sep 10, 2024Updated: Mar 28, 2025

The first multi-modal, text+image-to-text model from Mistral AI. Its weights were launched via torrent: https://x.com/mistralai/status/1833758285167722836.

32,768 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Hyperbolichyperbolic33K-$0.10/M$0.10/M68.0 t/s1051 ms
Mistralmistral131K-$0.15/M$0.15/M76.7 t/s571 ms

Standard Pricing

Input Tokens

per 1M tokens

$0.10

Output Tokens

per 1M tokens

$0.10

Image Processing

per image

$0.0001445

Do Work. With AI.