Mistral: Pixtral 12B

Mistral

Input: text

Input: image

Output: text

Released: Sep 10, 2024•Updated: Mar 28, 2025

The first multi-modal, text+image-to-text model from Mistral AI. Its weights were launched via torrent: https://x.com/mistralai/status/1833758285167722836.

Process and analyze large documents and conversations.

Improved capabilities in front-end development and full-stack updates.

Autonomously navigate multi-step processes with improved reliability.

Process and understand images alongside text inputs.

Available On

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Hyperbolic	hyperbolic	33K	-	$0.10/M	$0.10/M	68.0 t/s	1051 ms
Mistral	mistral	131K	-	$0.15/M	$0.15/M	76.7 t/s	571 ms

Input Tokens

per 1M tokens

$0.10

Output Tokens

per 1M tokens

$0.10

Image Processing

per image

$0.0001445