View cache analytics in the Requesty Console.
How Auto Cache Works
Theauto_cache flag is a boolean parameter sent within the requesty field in your request payload.
| Value | Behavior |
|---|---|
true | Instructs the router to cache the response from the provider |
false | Bypasses caching for this request (useful when cache writes have extra costs) |
| Not provided | Falls back to default behavior based on request origin (e.g., Cline and Roo Code default to caching) |
How to Use Auto Cache
Include theauto_cache flag within the requesty object in your request:
Important Notes
Provider Support: The
auto_cache flag is respected by providers where cache writes incur extra costs, including Anthropic and Gemini.- Explicit Control:
auto_cacheprovides explicit control. Set totrueto attempt caching,falseto prevent caching for providers where cache writes incur extra costs. - Default Behavior: If
auto_cacheis not specified, the caching behavior reverts to defaults based on request origin. - Cost Savings: Cache hits are billed at a fraction of the normal input token cost. This is especially effective for applications with large system prompts or knowledge bases.