Grok 4 Fast is xAI's latest advancement in cost-efficient reasoning models. Built on xAI’s learnings from Grok 4, Grok 4 Fast delivers frontier-level performance across Enterprise and Consumer domains—with exceptional token efficiency. This model pushes the boundaries for smaller and faster AI, making high-quality reasoning accessible to more users and developers. Grok 4 Fast features state-of-the-art (SOTA) cost-efficiency, cutting-edge web and X search capabilities, a 2M token context window, and a unified architecture that blends reasoning and non-reasoning modes in one model.
Input: Text, Image
Output: Text
Providers
xai
Credits
Context2M
Max Output2M
Input$0.200/1M
Output$0.500/1M
Cache Read$0.050/1M
Cache Write—
helicone
Credits
Context2M
Max Output2M
Input$0.200/1M
Output$0.500/1M
Cache Read$0.050/1M
Cache Write—
Quick Start
Use xAI: Grok 4 Fast Reasoning through Helicone's AI Gateway with automatic logging and monitoring.