OpenAI: GPT-4o (2024-08-06)
The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more here.
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities.
For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"
128,000 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
OpenAI | openAi | 128K | 16K | $2.50/M | $10.00/M | 69.2 t/s | 589 ms |
Azure | azure | 128K | 16K | $2.50/M | $10.00/M | 129.0 t/s | 1237 ms |
per 1K tokens
per 1K tokens
per image
per 1K tokens
search threshold
Threshold: high
Request: $0.05
search threshold
Threshold: medium
Request: $0.035
search threshold
Threshold: low
Request: $0.03