Groq Models

Available models
Key features
Best use cases
Related models

Groq provides blazingly fast AI inference using custom LPU (Language Processing Unit) hardware, delivering the fastest response times available.

Available models

Llama 3 (via Groq)

Strengths: Extreme speed, low latencyFastest inference available

Mixtral (via Groq)

Strengths: Speed with capabilityFast mixture-of-experts model

Key features

Extreme speed: Fastest inference in the industry
Low latency: Sub-second response times
High throughput: Process many requests quickly
Competitive quality: Good model performance

Best use cases

Real-time applications
Interactive chat experiences
High-volume API processing
Latency-sensitive applications
Rapid prototyping

Groq is ideal when speed is the primary concern. Use for real-time applications where immediate responses matter.

All models

Browse all models

OpenAI

GPT models for capability

DeepSeek Models xAI (Grok) Models

Overview

Getting Started

Chat Interface

Projects & Teams

Tools & Extensions

Assistants

Prompts & Memory

Integrations & Connectors

Models & Providers

Account & Billing

Troubleshooting

Available models

Llama 3 (via Groq)

Mixtral (via Groq)

Key features

Best use cases

All models

OpenAI

Overview

Getting Started

Chat Interface

Projects & Teams

Tools & Extensions

Assistants

Prompts & Memory

Integrations & Connectors

Models & Providers

Account & Billing

Troubleshooting

Documentation Index

​Available models

Llama 3 (via Groq)

Mixtral (via Groq)

​Key features

​Best use cases

​Related models

All models

OpenAI

Available models

Key features

Best use cases

Related models