gpt-oss-20b is our medium-sized open-weight model for low latency, local, or specialized use-cases (21B parameters with 3.6B active parameters). Features permissive Apache 2.0 license, configurable reasoning effort (low, medium, high), full chain-of-thought access, fine-tunable parameters, and agentic capabilities including function calling, web browsing, Python code execution, and structured outputs.
Input: Text
Output: Text
Providers
novita
Credits
Context131k
Max Output131k
Input$0.050/1M
Output$0.200/1M
Cache Read—
Cache Write—
groq
Credits
Context131k
Max Output131k
Input$0.100/1M
Output$0.500/1M
Cache Read—
Cache Write—
openrouter
Credits
Context131k
Max Output131k
Input (Max)$0.110/1M
Output (Max)$0.530/1M
Cache Read—
Cache Write—
Quick Start
Use OpenAI GPT-OSS 20b through Helicone's AI Gateway with automatic logging and monitoring.