Qwen3-32B is a 32.8 billion parameter language model that uniquely supports seamless switching between thinking mode for complex reasoning tasks and non-thinking mode for efficient general dialogue within a single model. The model excels across 100+ languages with enhanced reasoning capabilities, superior human preference alignment, and strong agent-based task performance, supporting up to 131,072 tokens with YaRN extension.
Input: Text
Output: Text
Providers
groq
Credits
Context131k
Max Output41k
Input$0.290/1M
Output$0.590/1M
Cache Read—
Cache Write—
openrouter
Credits
Context131k
Max Output41k
Input (Max)$0.422/1M
Output (Max)$0.844/1M
Cache Read—
Cache Write—
Quick Start
Use Qwen3 32B through Helicone's AI Gateway with automatic logging and monitoring.