Fallback Policies automatically retry your requests with different models if one fails, ensuring your application stays reliable even when individual providers have issues.Documentation Index
Fetch the complete documentation index at: https://docs.requesty.ai/llms.txt
Use this file to discover all available pages before exploring further.
Set up fallback policies in the Requesty Console.
How It Works
Automatic failover on failure
If it fails (timeout, rate limit, error, etc.), the router immediately tries the next model in the chain.
Benefits
Higher Success Rates
No more failed requests due to provider issues.
Zero Downtime
Automatic failover without code changes.
Cost Optimization
Start with cheaper models, fall back to premium ones only when needed.
No Stalled Workflows
Your users never see “model unavailable” errors.
Creating a Fallback Policy
Create the Policy
Go to Routing Policies, click Create Policy, and select Fallback Chain as the policy type.

Configure Your Fallback Chain
Set up your models in priority order. For example:
The router will try each model in order, retrying the configured number of times before moving to the next.
| Priority | Model | Retries |
|---|---|---|
| 1st | anthropic/claude-sonnet-4-5 | 1 retry |
| 2nd | bedrock/claude-sonnet-4-5@eu-central-1 | 1 retry |
Use Cases
Cost-Effective GPT Chain
Cost-Effective GPT Chain
Start with cheaper models, only use expensive ones if needed:
| Priority | Model | Retries |
|---|---|---|
| 1st | openai/gpt-4o-mini | 2 retries |
| 2nd | openai/gpt-4o | 1 retry |
| 3rd | openai/gpt-5.2 | 1 retry |
Multi-Provider Reliability
Multi-Provider Reliability
Distribute across providers for maximum uptime:
| Priority | Model | Retries |
|---|---|---|
| 1st | openai/gpt-5.2 | 1 retry |
| 2nd | anthropic/claude-sonnet-4-5 | 1 retry |
| 3rd | google/gemini-2.5-pro | 1 retry |
Regional Failover
Regional Failover
Try regional endpoints before falling back to global:
| Priority | Model | Retries |
|---|---|---|
| 1st | bedrock/claude-sonnet-4-5@eu-central-1 | 2 retries |
| 2nd | anthropic/claude-sonnet-4-5 | 2 retries |
How Retries Work
Each model in the chain can have 0 to 10 retries. The router uses:| Mechanism | Description |
|---|---|
| Exponential backoff | Wait time increases between retries (500ms → 1s → 2s → 4s) |
| Jitter | Random variation (±10%) to prevent thundering herd |
| Immediate failover | On non-retryable errors (invalid request, auth failure) |
Key Selection (BYOK)
For each model, you can choose which API key to use:| Option | Description |
|---|---|
| Requesty provided key | Use Requesty’s managed keys (default) |
| My own key | Use your BYOK credentials |
| Requesty first, then mine | Fallback to BYOK if Requesty key fails |
| Mine first, then Requesty | Prefer BYOK, fallback to Requesty |
Monitoring and Debugging
Open Analytics
Go to Analytics.
FAQ
What happens if all models in the chain fail?
What happens if all models in the chain fail?
The request returns an error with details about the last model attempted. You will see all the failures in your request logs.
Can I nest policies?
Can I nest policies?
Yes. A fallback policy can reference another policy as one of its fallback options. For example, your second priority could be
policy/multi-provider-backup instead of a single model.Do I get charged for failed attempts?
Do I get charged for failed attempts?
No. You only pay for successful requests that return tokens. Failed attempts do not incur costs.
How do I update a policy?
How do I update a policy?
Click the edit icon next to your policy in the Routing Policies page. Changes take effect immediately, no code deployment needed.