meta-llama: Meta Llama 3.3 70B Instruct

llama-3.3-70b-instruct
Context: 128k
Max Output: 16k
Input: $0.130/1M
Output: $0.390/1M
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks. Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Input: Text
Output: Text

Providers

novita
Credits
Context128k
Max Output16k
Input$0.130/1M
Output$0.390/1M
Cache Read
Cache Write
openrouter
Credits
Context128k
Max Output16k
Input (Max)$0.950/1M
Output (Max)$2.37/1M
Cache Read
Cache Write

Quick Start

Use Meta Llama 3.3 70B Instruct through Helicone's AI Gateway with automatic logging and monitoring.