alibaba: Qwen3 32B

qwen3-32b
Context: 131k
Max Output: 41k
Input: $0.290/1M
Output: $0.590/1M
Qwen3-32B is a 32.8 billion parameter language model that uniquely supports seamless switching between thinking mode for complex reasoning tasks and non-thinking mode for efficient general dialogue within a single model. The model excels across 100+ languages with enhanced reasoning capabilities, superior human preference alignment, and strong agent-based task performance, supporting up to 131,072 tokens with YaRN extension.
Input: Text
Output: Text

Providers

groq
Credits
Context131k
Max Output41k
Input$0.290/1M
Output$0.590/1M
Cache Read
Cache Write
openrouter
Credits
Context131k
Max Output41k
Input (Max)$0.422/1M
Output (Max)$0.844/1M
Cache Read
Cache Write

Quick Start

Use Qwen3 32B through Helicone's AI Gateway with automatic logging and monitoring.