Introduction
Get started with NemoRouter — your enterprise LLM gateway
Introduction
NemoRouter is an enterprise LLM gateway that gives your team one API key, one bill, and zero provider configuration. Access models from OpenAI, Anthropic, Google, AWS Bedrock, Azure OpenAI, Meta, Mistral, and every major provider through a single OpenAI-compatible endpoint.

Why NemoRouter?
Managing multiple LLM providers is painful. Different API keys, different billing dashboards, different SDKs, different rate limits. NemoRouter eliminates all of that.
| Without NemoRouter | With NemoRouter |
|---|---|
| Manage API keys for every provider | One NEMOROUTER_API_KEY for everything |
| Separate billing per provider | Single credit balance, one invoice |
| Different SDKs and endpoints | OpenAI-compatible — use any existing SDK |
| No visibility across providers | Unified analytics, spend tracking, and logs |
| No safety controls | Built-in guardrails, budget controls, and rate limits |
Key Features

One Key, One Bill
Buy credits, call any model. NemoRouter manages all provider keys behind the scenes. You never touch a provider API key.
OpenAI-Compatible API
Use any OpenAI SDK or HTTP client. Just change the base URL to https://api.nemorouter.ai/v1 and swap your API key. Your existing code works immediately.
Every Major LLM Provider
Access models from every major provider through a single endpoint:
- OpenAI — GPT-4o, GPT-4 Turbo, o1, o3
- Anthropic — Claude 4 Opus, Claude 4 Sonnet, Claude 3.5 Haiku
- Google — Gemini 2.5 Pro, Gemini 2.5 Flash
- Meta — Llama 3.3, Llama 4 Scout, Llama 4 Maverick
- Mistral — Mistral Large, Codestral, Mistral Small
- And many more
Enterprise Controls
- Guardrails — Content safety, PII detection, prompt injection prevention
- Budget controls — Spending limits per key, team, or organization
- Rate limits — RPM and TPM controls to prevent runaway costs
- Team management — Roles, permissions, and per-team budgets
- Analytics — Real-time spend tracking, usage breakdowns, and request logs
Getting Started
Getting up and running with NemoRouter takes three steps:
1. Create an Account
Sign up at nemorouter.ai and complete the onboarding flow. You'll set up your organization and choose a plan.
2. Get Your API Key
After onboarding, create an API key from the API Keys page in your dashboard. Your key will look like sk-nemo-xxxx. Copy it immediately — it's only shown once.
3. Make Your First Request
curl https://api.nemorouter.ai/v1/chat/completions \
-H "Authorization: Bearer $NEMOROUTER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-4o",
"messages": [
{"role": "user", "content": "Hello, NemoRouter!"}
]
}'That's it. You're calling GPT-4o through NemoRouter. Change the model field to claude-4-sonnet or gemini-2.5-pro to use a different provider — no configuration needed.
How It Works
Your App → NemoRouter API → Provider (OpenAI, Anthropic, Google, etc.)NemoRouter sits between your application and the LLM providers. Every request flows through NemoRouter, which handles:
- Authentication — Validates your API key and checks permissions
- Guardrails — Applies any configured safety controls
- Routing — Sends the request to the right provider
- Cost tracking — Records token usage and spend against your credit balance
- Response — Returns the provider's response with NemoRouter headers for observability
Pricing

NemoRouter uses a credit-based system with a platform fee that decreases as you scale:
| Tier | Monthly Minimum | Platform Fee | Features |
|---|---|---|---|
| Pay As You Go | $0 | 4% | All features included |
| Tier 2 | $100/mo | 2% | All features included |
| Tier 3 | $1,200/yr | 0% | All features included |
| Enterprise | Contact sales | 0% | Custom SLAs + support |
All features are available on every tier. Higher tiers simply reduce the platform fee charged on top of your credit purchases.
Next Steps
- Quick Start — Make your first API call in under 2 minutes
- Authentication — Learn how API keys and auth work
- Chat Completions — Explore the Chat Completions API
- Python SDK — Get started with the Python SDK