Requesty tracks every dollar you spend on AI β broken down by model, provider, user, API key, and time period. See exactly where your budget goes and how much youβre saving through caching and routing optimization.Documentation Index
Fetch the complete documentation index at: https://docs.requesty.ai/llms.txt Use this file to discover all available pages before exploring further.
Cost Overview
The Analytics dashboard shows your spending in real-time:- Total Spend: Current cost for the selected time period
- Cost Over Time: Bar chart showing daily/weekly/monthly cost trends
- Cost by Model: See which models cost the most
- Cost by User: Track per-user spending across your organization
- Projected Spend: Estimated end-of-period costs based on current rate
Cost Breakdown
Group your costs by any dimension:| Group By | What you see |
|---|---|
| Model | Cost per model (e.g., anthropic/claude-sonnet-4-5: 180) |
| Provider | Total spend per provider (Anthropic, OpenAI, Google, etc.) |
| User | Per-user cost attribution |
| API Key | Cost per API key β useful for tracking per-application spend |
| Custom Field | Any Request Metadata field (e.g., environment, feature) |
Savings Tracking
The Savings tab in analytics shows how much youβre saving:- Cost Savings ($): Total dollars saved through prompt caching
- Cost Savings (%): Percentage saved vs. what you would have paid without caching β shown as a gauge chart
- Cache Hit Rate: What percentage of your requests hit cache
- Token Cache Rate: What percentage of tokens are served from cache
- Per-Model Savings: Table showing which models benefit most from caching
BYOK Cost Comparison
When using Bring Your Own Keys, the Cost Overview chart splits into:- Requesty Cost: What you pay through Requesty-managed keys
- Provider Cost: What you pay directly to providers through your own keys
Budget Controls
Control spending at multiple levels:- Project Limits: Set monthly spend caps per project β each userβs Private project can have its own limit
- API Key Limits: Set monthly spend caps per individual API key
- Spending Alerts: Get notified via email or webhook when spending exceeds configurable thresholds
- Group Budgets: Set spending limits for teams
Cost Optimization Tips
Use Cheaper Models Where Possible
Not every task needs a frontier model. Use the Advanced analytics tab to find tasks where a cheaper model would work:- Route simple classification tasks to
openai/gpt-4o-miniinstead ofgpt-4.1 - Use
anthropic/claude-haiku-4-5for summarization instead of Sonnet
Enable Caching
Auto Caching can reduce costs by 25-90% for workloads with repeated system prompts or similar queries. Check your Savings tab to see your current cache hit rate.Use Routing Policies
Fallback Policies can start with cheaper models and only fall back to expensive ones when needed β automatically reducing your average cost per request.Track with Metadata
Tag requests with Request Metadata to understand cost by feature, environment, or customer:feature, environment, or customer_id in the Advanced analytics tab to see cost breakdowns.
Export Cost Data
From the Advanced tab, export your cost data as CSV:- Set Metric to cost, Group By to model (or any dimension)
- Choose your time range
- Click Export CSV in the Data Summary table
Integration
- Usage Analytics β Full dashboard overview
- Performance Monitoring β Correlate cost with latency
- Spend Limits β Set budget caps per project or API key
- Spending Alerts β Automated notifications on spend thresholds
- Request Metadata β Custom cost attribution dimensions