Auto-failover for long runs
Agents fan out dozens of LLM calls per task. One provider hiccup halfway through a run shouldn’t kill it. The fallback chain retries the next link transparently so the agent keeps going.
- Ordered fallback chain per model group
- Timeouts, 5xx, and circuit-breaks all trigger the next link
- Cross-provider failover for planning and tool-use calls alike
- Each fallback logged so you can see what degraded