Anthropic: Claude 3.5 Haiku (2024-10-22)
Claude 3.5 Haiku features enhancements across all skill sets including coding, tool use, and reasoning. As the fastest model in the Anthropic lineup, it offers rapid response times suitable for applications that require high interactivity and low latency, such as user-facing chatbots and on-the-fly code completions. It also excels in specialized tasks like data extraction and real-time content moderation, making it a versatile tool for a broad range of industries.
It does not support image inputs.
See the launch announcement and benchmark results here
200,000 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Anthropic | anthropic | 200K | 8K | $0.80/M | $4.00/M | 52.9 t/s | 1442 ms |
Google Vertex | vertex | 200K | 8K | $0.80/M | $4.00/M | 62.1 t/s | 2073 ms |
per 1K tokens
per 1K tokens
per 1K tokens
per 1K tokens