Rate Limit Strategist Protocol

This skill designs the throttling and quota mechanisms that protect an API from noisy neighbors, accidental infinite loops in client code, and malicious abuse. It shifts the focus from "how to code it" to "what the limits should actually be."

Core assumption: Without rate limits, your API will eventually be DDOSed by your own front-end bug.

1. Algorithm Selection (Static)

Select the right rate-limiting algorithm based on traffic characteristics:

Token Bucket / Leaky Bucket: Best for general APIs. Allows small bursts of traffic (e.g., a burst of 10 requests) but smooths out average flow.
Fixed Window: Simple to implement (e.g., reset at the top of the minute), but vulnerable to edge spikes (submitting 100 requests at 00:59 and 100 at 01:00).
Sliding Window Log/Counter: More accurate, prevents edge spikes. Best for strict, paid-tier APIs.

2. Granularity & Dimensions

Rate limits should rarely be global. Define multiple layers:

Layer 1: Global/IP (Infrastructure): Prevent DDOS (e.g., 500 req/sec per IP at Cloudflare/WAF).
Layer 2: User Level (Application): Prevent noisy neighbors (e.g., 100 req/min for User A, 1000 req/min for Enterprise User B).
Layer 3: Endpoint Level (Business Logic): Highly restrictive on expensive endpoints (e.g., /export-pdf limited to 1 req/min).

rate-limit-strategist

Rate Limit Strategist Protocol

1. Algorithm Selection (Static)

2. Granularity & Dimensions

More from fatih-developer/fth-skills

task-decomposer

multi-brain-debate

context-compressor

multi-brain-score

checkpoint-guardian

multi-brain