G
Groq✓
API
Groq
LPU Inference Engine, world's fastest inference speed. Free plan supports Llama 3.1 8B (30RPM/14.4K RPD), Llama 3.3 70B (30RPM/1K RPD), Qwen3 32B (60RPM/1K RPD) and more. OpenAI-compatible API.
Free Limits
Llama 8B: 30RPM/14.4K RPD; Llama 70B: 30RPM/1K RPD; Qwen3: 60RPM/1K RPD
Community Votes
1,258
■ Available Models
Llama 3.1 8B InstantLlama 3.3 70B VersatileLlama 4 Scout 17BQwen3 32BOpenAI gpt-oss-120b
■ Tags
Free TierFastest InferenceOpenAI CompatibleNo Credit Card