openai: OpenAI GPT-4.1

gpt-4.1

Context: 1M

Max Output: 33k

Input: $2.00/1M

Output: $8.00/1M

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

Input: Text, Image

Output: Text

Providers

openai

Credits

Context1M

Max Output33k

Input$2.00/1M

Output$8.00/1M

Cache Read$0.500/1M

Cache Write—

azure

Credits

Context1M

Max Output33k

Input$2.00/1M

Output$8.00/1M

Cache Read$0.500/1M

Cache Write—

helicone

Credits

Context1M

Max Output33k

Input$2.00/1M

Output$8.00/1M

Cache Read$0.500/1M

Cache Write—

openrouter

Credits

Context1M

Max Output33k

Input (Max)$2.11/1M

Output (Max)$8.44/1M

Cache Read—

Cache Write—

Quick Start

Use OpenAI GPT-4.1 through Helicone's AI Gateway with automatic logging and monitoring.