Engineering
Reading x-litellm-response-cost While Streaming
When a response streams token by token, the final cost arrives last. Here is how NemoRouter captures the authoritative per-call cost from the response header and settles credits exactly — without computing cost itself.
Nemo Team
8 min