$5 free credits when you sign up
Tag

rate-limiting

4 posts tagged "rate-limiting".

Posts

Latest first

Guides

Budgets vs Rate Limits: Which Control to Reach For

Both return 429, but they solve different problems. Here is a decision guide for when to use a budget cap, when to use a rate limit, and why you almost always want both.

Nemo Team
7 min
Guides

Handling 429 and 402 Errors From an LLM Gateway

A 429 and a 402 mean different things and need different client logic. Here is how to handle rate limits, budget caps, and out-of-credits responses gracefully — with backoff, not blind retries.

Nemo Team
8 min
Engineering

RPM and TPM Rate Limiting Per Key, Team, and Org

Rate limits cap velocity, not total spend — and they're a security boundary, not a knob. Here is how RPM/TPM limits work per key, team, and org, and why the caller can never override them.

Nemo Team
8 min
Guides

The Four Ceilings Every LLM Request Passes

Before an LLM call leaves the gateway it clears four independent limits: credit balance, budget cap, rate limit, and guardrails. Here is what each one checks and why they are separate.

Nemo Team
8 min