Back

OpenAI: GPT-4o (2024-08-06)

GPT
Input: text
Input: image
Input: file
Output: text
Released: Aug 6, 2024Updated: Apr 23, 2025

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more here.

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities.

For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"

128,000 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Azureazure128K16K$2.50/M$10.00/M120.0 t/s1114 ms
OpenAIopenAi128K16K$2.50/M$10.00/M47.7 t/s548 ms

Standard Pricing

Input Tokens

per 1M tokens

$2.50

Output Tokens

per 1M tokens

$10.00

Image Processing

per image

$0.003613

Input Cache Read

per 1M tokens

$1.25

Variable Pricing Tiers

search threshold

Threshold: high

Request: $0.05

search threshold

Threshold: medium

Request: $0.035

search threshold

Threshold: low

Request: $0.03

Do Work. With AI.