Google: Gemini 1.5 Flash
Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It's adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots.
Gemini 1.5 Flash is designed for high-volume, high-frequency tasks where cost and latency matter. On most common tasks, Flash achieves comparable quality to other Gemini Pro models at a significantly reduced cost. Flash is well-suited for applications like chat assistants and on-demand content generation where speed and scale matter.
Usage of Gemini is subject to Google's Gemini Terms of Use.
#multimodal
1,000,000 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Google Vertex | vertex | 1000K | 8K | $0.07/M | $0.30/M | 152.9 t/s | 343 ms |
Google AI Studio | 1000K | 8K | $0.07/M | $0.30/M | 154.4 t/s | 443 ms |
per 1K tokens
per 1K tokens
per image
per 1K tokens
per 1K tokens
prompt threshold
Threshold: 128000
Prompt: $0.00000015 / Completion: $0.0000006 (per 1K tokens)