Back

Anthropic: Claude 3 Haiku

Claude
Input: text
Input: image
Output: text
Released: Mar 13, 2024Updated: Mar 28, 2025

Claude 3 Haiku is Anthropic's fastest and most compact model for
near-instant responsiveness. Quick and accurate targeted performance.

See the launch announcement and benchmark results here

#multimodal

200,000 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Anthropicanthropic200K4K$0.25/M$1.25/M144.9 t/s924 ms
Googlevertex200K4K$0.25/M$1.25/M156.2 t/s1284 ms

Standard Pricing

Input Tokens

per 1M tokens

$0.25

Output Tokens

per 1M tokens

$1.25

Image Processing

per image

$0.0004

Input Cache Read

per 1M tokens

$0.03

Input Cache Write

per 1M tokens

$0.30

Do Work. With AI.