groq-reference-architecture

Installation
SKILL.md

Groq Reference Architecture

Overview

Production architecture for ultra-fast LLM inference with Groq LPU. Covers model routing by latency requirements, streaming pipelines, fallback strategies, and integration patterns for real-time AI applications.

Prerequisites

  • Groq API key
  • groq-sdk npm package
  • Understanding of model capabilities (Llama, Mixtral)
  • Monitoring for latency and token usage

Architecture Diagram

Installs
25
GitHub Stars
2.4K
First Seen
Jan 25, 2026
groq-reference-architecture — jeremylongshore/claude-code-plugins-plus-skills