New0% platform fee on Tier 2+ — upgrade from 4% Pay As You Go. Start now
Provider-agnostic routing

One API.
200+ models.

Call OpenAI, Anthropic, Google, Meta, and Mistral through a single endpoint. We manage every provider key. You just pick a model and send your request.

routing-demo.py
# Just change the base_url — everything else stays the same
from openai import OpenAI
client = OpenAI(
api_key="sk-nemo-...", # your Nemo key
base_url="https://api.nemorouter.com/v1"
)
# Call any model from any provider
response = client.chat.completions.create(
model="gpt-4o", # or claude-3.5-sonnet, gemini-pro...
messages=[{"role": "user", "content": "Hello!"}]
)
routedvia load-balancer

Routing capabilities

Smart routing. Zero effort.

Smart Load Balancing

usage-based · latency-based

Routes requests across multiple deployments using configurable strategies. Distribute load by usage, optimize for latency, or minimize cost. Automatic health-aware routing skips unhealthy endpoints.

Request

Router

GPT-4o

Claude 3.5

Gemini

strategy: usage-basedcooldown: 5s

Automatic Fallbacks

seamless retry

If the primary model fails or times out, seamlessly retry on a backup provider. Your users never see the error.

Primary Fallback

Retries & Timeouts

configurable

Set per-org retry count, timeout duration, and cooldown between attempts. Fine-tune reliability for your workload.

retries: 3
timeout: 30s
cooldown: 5s

OpenAI-Compatible API

zero migration

Drop-in compatible with the OpenAI SDK. Change two lines — base URL and API key — and you're routing through 200+ models. Works with Python, Node.js, Go, Ruby, Java, C#, PHP, and Rust.

- base_url="https://api.openai.com/v1"
+ base_url="https://api.nemorouter.com/v1"
- api_key="sk-proj-..."
+ api_key="sk-nemo-..."

Model Catalog

200+ models from every major provider. New models added within hours of launch.

OpenAIAnthropicGoogleMetaMistral+4 more

Tag-Based Routing

Filter models by capability tags — vision, code, long-context, multilingual. Route requests to the right subset.

visioncodelong-contextmultilingual

The routing pipeline

Every request flows through the same path. Guardrails, routing, and cost tracking happen transparently.

Your app

OpenAI SDK

Nemo Backend

Guardrails + config

Strategy select

Load balance / fallback

Provider

GPT-4o, Claude, etc.

Response

Post-scan + deliver

<1ms

Routing overhead

200+

Models

15+

Providers

FAQ

Route with confidence

200+ models. Automatic fallbacks. Zero provider config. Your first request routes automatically.