DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process. Users can control the reasoning behaviour with the reasoning enabled boolean. The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.
Input: Text
Output: Text
Providers
deepinfra
Credits
Context128k
Max Output16k
Input$0.270/1M
Output$1.00/1M
Cache Read$0.216/1M
Cache Write—
novita
Credits
Context128k
Max Output16k
Input$0.270/1M
Output$1.00/1M
Cache Read—
Cache Write—
Quick Start
Use DeepSeek V3.1 Terminus through Helicone's AI Gateway with automatic logging and monitoring.