How we cut LLM costs 60% without locking features behind an enterprise tier
Four levers — gateway switch, Tier 3 prepay, provider reservations, free per-team budgets — stacked, with a worked $40k/mo example. Every number cited.
Two ways an LLM gateway can charge you: feature gating or platform-fee gating. Four competitors do the first; NemoRouter does the second. Public pricing pages, cited row by row, with the math for a $5k/mo team.
The wedge claim: NemoRouter is the only LLM gateway that gives every customer all enterprise features — guardrails, A/B tests, prompt management, evals, budgets — free for life, with every major LLM provider behind one API key. Tiers vary the platform fee (4% / 2% / 0%); they never lock features.
There are two ways an LLM gateway can charge you. The first is feature gating — the routing is free or cheap, but guardrails, evals, A/B tests, prompt management, and per-team budgets sit one tier up, then another tier up, then "contact sales". The second is platform-fee gating — every feature is on every tier; what varies is the percentage on top of provider cost.
Four of the five gateways teams actually shortlist in 2026 do the first thing. NemoRouter does the second. This post is the audit: we walk the public pricing pages of Portkey, LiteLLM, Helicone, and OpenRouter, and we mark — feature by feature — which tier you have to be on to use that feature today. Every claim cites the vendor's own public docs or pricing page with a 2026-05-16 audit timestamp.
We are explicit about where NemoRouter sits — that is the wedge: every governance feature you would otherwise pay an enterprise contract for is included on Tier 1 (PAYG, 4% platform fee) the moment you sign up.
Five-minute path: the gating matrix, what each vendor actually charges for, the math for a $5k/mo team.
A modern LLM gateway has three layers:
/chat/completions, Anthropic-shaped /messages. Commodity. Every
gateway ships this.Every gateway gives you Layer 1 cheaply. The question is what each gateway does with Layer 2 — bundle it, or sell it as the upgrade path. NemoRouter bundles all of Layer 2 on Tier 1 and earns Layer 3 economics on Tier 3 ($1,200/yr prepay, 0% platform fee) as customers scale. Competitors monetize Layer 2 by tier. The matrix below is just that decision, drawn out.
Read this as: "on which tier of this vendor does this feature first appear without a sales call?" Where a vendor distributes a feature across multiple tiers, we cite the lowest tier on which it appears at all. Every cell is sourced from the vendor's own public pricing or documentation page; Sources section at the bottom carries the URLs and audit dates.
| Capability | OpenRouter | Portkey | LiteLLM | Helicone | NemoRouter |
|---|---|---|---|---|---|
| Routing & fallback | Routing-first product | Yes, Pro+ | Yes, OSS + Cloud | Limited; observability-first | Yes, every tier |
| Guardrails (PII / jailbreak / regex) | Not offered as a product feature | Pro / Enterprise | Self-host config only | Pro / Enterprise | Free, every tier |
| A/B test routing | Not offered | Enterprise | Self-host config only | Not offered | Free, every tier |
| Per-team / per-customer budgets + virtual keys | Not offered | Enterprise | Self-host config only | Not offered | Free, every tier |
| Eval pipelines | Not offered | Enterprise | Self-host config only | Pro / Enterprise | Free, every tier |
| Prompt management & versioning | Not offered | Pro / Enterprise | Partial, self-host | Limited | Free, every tier |
| Observability & attribution (per-team, per-customer) | Spend dashboard only | Pro+ | Yes, OSS + Cloud | Yes — primary product | Yes, every tier |
| Annual prepay → 0% platform fee | 5% credit fee, no annual tier | Annual contract / sales | Cloud Enterprise contract; self-host always free | Annual contract / sales | Tier 3, $1,200/yr |
| Provider reservation pooling (PTU / GSU / Bedrock PT) | Not documented | Not documented | Not at the gateway layer | Not at the gateway layer | Yes, post-$10k ARR |
OpenRouter, Portkey, LiteLLM, and Helicone are trademarks of their respective owners. NemoRouter is not affiliated with or endorsed by any of these vendors. Every row is sourced from each vendor's public pricing or documentation page on 2026-05-16; if any has changed, email hello@nemorouter.ai and we'll update.
Five rows of the matrix are governance features. On NemoRouter, all five are on Tier 1. On Portkey, Helicone, and LiteLLM Cloud, at least three of the five sit on a paid plan or a sales call. On OpenRouter, four of the five are not documented as product features at all — OpenRouter is, by its own framing, a routing product.
It is fair to ask: what is each gateway monetizing? The answer differs across the four — and that difference is what makes the comparison interesting.
OpenRouter charges a 5% platform fee on credit purchases on top of provider pass-through. The product surface is routing: model catalog, fallback chains, an OpenAI-shaped endpoint. Guardrails, evals, A/B testing, and per-team budgets are not documented as product features. If you need any of those, the answer in 2026 is to layer a second tool on top of OpenRouter — typically Portkey or Helicone — and pay for that separately.
Verdict: best fit for hobbyists and prototypes where governance is genuinely DEFER. Anything else, you are paying 5% to do routing and then paying again for the layer that controls cost and quality. The deeper head-to-head is in the OpenRouter alternative comparison.
Portkey runs a tiered subscription. On the free tier you get routing and some observability; guardrails, prompt management, evals, A/B testing, and per-team budgets are distributed across the Pro and Enterprise plans. Annual prepay is an Enterprise contract that goes through sales.
Verdict: strong feature set, but governance is the monetization lever. If you need three or four of the Layer-2 features, you are buying Pro or Enterprise. That is a fine business model — it is just not the same model as ours, and a $5k/mo team should price both before committing.
LiteLLM is genuinely open source. If you self-host the OSS proxy, you get routing, guardrails configuration, virtual keys, and observability for $0 — modulo the operational cost of running it. LiteLLM Cloud and Enterprise are the managed-service path, and that is where the gating lives: governance and SSO and team budgets at scale sit on the paid contract.
Verdict: the right answer for teams who want to self-host. It is the only credible OSS option in this comparison. The trade is operational: you run the gateway, you handle the upgrades, you write the alerting. For mid-market SaaS adding AI as a side feature, that operational tax usually outweighs the saved subscription. For platform/infra engineers who already run Kubernetes and want air-gap, it is the right call. See the buyer's guide § Buyer C for the full decision tree.
Helicone is the observability product in this matrix. It does routing, but the primary surface is per-request logs, traces, attribution, cost rollups, and dashboards. Guardrails and eval pipelines sit on the Pro and Enterprise tiers; per-team budgets and virtual keys at scale are an Enterprise concern.
Verdict: best fit when observability is your #1 problem and routing is #2. If guardrails, evals, and budgets all matter at the same time, you are buying the higher tier — at which point compare the annual cost line-for-line with Tier 3 below.
NemoRouter charges a platform fee — 4% on Tier 1 PAYG, 2% on Tier 2 ($100/mo minimum), 0% on Tier 3 ($1,200/yr prepay). All Layer-2 features (guardrails, A/B tests, prompt management, evals, per-team budgets, RLS multi-tenancy) ship on Tier 1. The tier you choose changes the percentage we take on top of provider cost; it does not change the feature set.
The Layer-3 economic — pooling customer volume into Azure PTU / Vertex GSU / Bedrock Provisioned Throughput reservations — engages after we cross $10k ARR, and powers the Tier 3 0% platform fee as we grow. Customers continue paying retail PAYG provider rates; the reservation spread is our margin. Provider reservation pricing is documented up to 70% off PAYG on annual commits (linked in Sources).
Verdict: best fit for teams that want every governance feature today and a credible annual path to 0% platform fee. The wedge is in the first sentence of this post; if it stops being true, we have failed.
Pricing-page tables hide the punchline. So here is the same scenario priced four ways. Inputs: a team spending $5,000/month on provider tokens, who needs guardrails and per-team budgets today. We are not pricing seats, dashboards, or SSO add-ons — only the gateway. Each row uses the lowest tier on each vendor's published page that covers the two MUSTs (guardrails + per-team budgets); where the vendor distributes those features across plans, we use the lowest plan that has both.
| Vendor | Lowest tier covering both MUSTs | Annual cost shape (gateway only) |
|---|---|---|
| OpenRouter | Cannot satisfy guardrails without a second vendor | 5% × $60k = $3,000 + a second tool |
| Portkey | Pro or Enterprise (varies by feature) | Per Portkey's published Pro/Enterprise tiers; verify on the live pricing page |
| LiteLLM Cloud | Cloud Enterprise (managed path) — OSS self-host is $0 + operational cost | OSS: $0 license + ops; Cloud Enterprise: per published contract |
| Helicone | Pro or Enterprise (varies by feature) | Per Helicone's published Pro/Enterprise tiers; verify on the live pricing page |
| NemoRouter Tier 2 | $100/mo minimum, 2% platform fee, all features on Tier 1 baseline | 12 × $100 + 2% × $60k = $2,400 |
| NemoRouter Tier 3 | $1,200/yr prepay, 0% platform fee, all features included | $1,200 flat |
We deliberately do not fill in live Portkey, Helicone, and LiteLLM Cloud numbers in the cells — those drift week to week and we will not invent them. Click the URLs in Sources, take 90 seconds, fill the row. The arithmetic is one cell each. If you would like us to run the exercise against your own invoices, that is what the 30-minute walk-through on /community is for; the full worked example is in the cost teardown.
Two observations that hold regardless of which numbers you write in:
We would rather you skip the switch than switch on a math error. Three cases where another option is the right answer in 2026:
Three cases where NemoRouter dominates regardless of which competitor we compare against:
$1,000.** Tier 1's 4% beats every paid plan we have seen at that spend — and Tier 3's 0% is uncontested at annual prepay.
If you are already on a gateway and reading this from the inside, run this before you switch anything:
/pricing on NemoRouter. Note that all of those features are on
Tier 1.If the savings clear the migration cost, you have a deal worth closing. If they don't, stay where you are with a clean conscience — that is the audit working as intended.
Migration into NemoRouter from an OpenAI-shaped gateway (which every competitor in this post exposes) is two lines for an existing OpenAI integration:
# Before — direct to OpenAI (or to any OpenAI-shaped gateway)
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
# After — through NemoRouter
client = OpenAI(
api_key=os.environ["NEMOROUTER_API_KEY"],
base_url="https://api.nemorouter.ai/v1",
)The Anthropic SDK accepts the same shape via its base_url parameter.
Typical scoping: half a day for one engineer plus 24–72 hours of dual-run.
Migration of prompt templates and guardrail configs is JSON in / JSON out —
not a rewrite.
We auto-grant $5 in API credits on signup, no card required. Enough to route 5–10 production prompts across Tier 1, exercise the guardrails + per-team budgets UI on your real traffic, and decide whether the Tier 3 math holds before you sign anything.
→ Start free at nemorouter.ai/signup — Tier 1, $5 credit, no card. Mid-market SaaS or larger? Bring your last 90 days of LLM invoices to a 30-min walk-through (book through /community) and we'll redo the matrix against your actual spend.
Vendor pricing/docs pages, verified 2026-05-16:
Four levers — gateway switch, Tier 3 prepay, provider reservations, free per-team budgets — stacked, with a worked $40k/mo example. Every number cited.
Head-to-head comparison: NemoRouter vs OpenRouter, Portkey, LiteLLM, Helicone. Guardrails, A/B tests, prompt management, evals, budgets — free on every tier. 4% PAYG, 0% on Tier 3 annual prepay. One base-URL switch.