Do Services-as-Software

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
OpenAI	openAi	1048K	33K	$2.00/M	$8.00/M	51.4 t/s	660 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

OpenAI

openAi

1048K

33K

$2.00/M

$8.00/M

51.4 t/s

660 ms

OpenAI: GPT-4.1

1,047,576 Token Context

Advanced Coding

Agentic Workflows

Vision Capabilities

Available On

Do Work. With AI.