mistral-rate-limits

Installation
SKILL.md

Mistral Rate Limits

Overview

Rate limit management for Mistral AI API. Mistral enforces per-workspace RPM (requests/minute) and TPM (tokens/minute) limits that vary by usage tier (Experiment free tier vs Scale pay-as-you-go). View your workspace limits at admin.mistral.ai/plateforme/limits.

Prerequisites

  • Mistral API key configured
  • Understanding of workspace tier (Experiment vs Scale)
  • Application with retry infrastructure

Mistral Rate Limit Architecture

Limits are set at the workspace level, not per key. All API keys in a workspace share the same RPM/TPM budget.

Endpoint What's limited
/v1/chat/completions RPM + TPM (input + output)
/v1/embeddings RPM + TPM (input only)
/v1/fim/completions RPM + TPM
/v1/moderations RPM
Related skills
Installs
24
GitHub Stars
2.2K
First Seen
Jan 27, 2026