Skip to main content

Documentation Index

Fetch the complete documentation index at: https://reedai-07fa30f1.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Groq provides blazingly fast AI inference using custom LPU (Language Processing Unit) hardware, delivering the fastest response times available.

Available models

Llama 3 (via Groq)

Strengths: Extreme speed, low latencyFastest inference available

Mixtral (via Groq)

Strengths: Speed with capabilityFast mixture-of-experts model

Key features

  • Extreme speed: Fastest inference in the industry
  • Low latency: Sub-second response times
  • High throughput: Process many requests quickly
  • Competitive quality: Good model performance

Best use cases

  • Real-time applications
  • Interactive chat experiences
  • High-volume API processing
  • Latency-sensitive applications
  • Rapid prototyping
Groq is ideal when speed is the primary concern. Use for real-time applications where immediate responses matter.

All models

Browse all models

OpenAI

GPT models for capability