Documentation Index
Fetch the complete documentation index at: https://docs.requesty.ai/llms.txt
Use this file to discover all available pages before exploring further.
Enforce SSO for your organization
Lock down access with Entra ID (Azure AD), Okta, or any OIDC/SAML provider. Members authenticate through your identity provider and land directly in Requesty.Set up SSO βRestrict models per API key with Access Lists
Create named model allow-lists and attach them to individual API keys or groups. Control exactly which models each key can call without touching your org-wide settings.Create an access list βUse the Responses API through Requesty
Route OpenAI/v1/responses calls through the gateway with full analytics, fallback, and cost tracking. Custom tool types are supported.Responses API reference βSee the cost of every request inline
API responses now include ausage.cost field with the exact dollar amount. For streaming, set stream_options.include_usage to get cost on the final chunk.Cost tracking docs βQuery organization-level usage
A new Management API endpoint returns aggregated spend and token counts across your entire org, with the same time filters available on key-level usage.Org Usage API βFilter models by deployment region
The/v1/models endpoint now returns geolocation data for each model. The Model Library shows EU/US region chips so you can pick the right model before routing.EU routing docs βRoute Pi through Requesty
Connect the Pi coding agent for model routing, cost tracking, and fallback policies across your coding workflows.Set up Pi βClearer error messages from every provider
Context length overflows, unsupported image formats, and other provider errors are now translated into plain, actionable messages instead of generic errors.Generate speech and transcribe audio
Two new endpoints:/v1/audio/speech for text-to-speech and /v1/audio/transcriptions for speech-to-text. Multiple providers including OpenAI and Mistral with automatic fallback.Speech API β Β· Transcription API βEdit images through the gateway
Send image edit requests through/v1/images/edits with the same multi-provider routing and fallback as generation.Image Edits API βTag traffic by app with analytics headers
PassHTTP-Referer and X-Title headers to label requests by app or site. Filter your analytics dashboard by these values to see cost and latency per integration.Analytics headers docs βGet alerted before budgets run out
Set dollar thresholds on API keys and receive Slack or Microsoft Teams webhooks when spend crosses them. Configurable trigger percentages give you time to act.Configure alerts βRoute Claude Cowork through your org
Use Requesty as the backend for the Claude Cowork desktop assistant. All traffic gets unified analytics, cost controls, and model policies.Set up Claude Cowork βConnect OpenCode with one-liner analytics
Route OpenCode terminal agent traffic through Requesty. A one-line installer adds analytics tracking to your setup.Set up OpenCode βControl guardrail actions per policy
The admin panel now lets you set each guardrail policy to Disabled, Report, or Mask individually. A new violations column and detail tab in logs shows exactly what fired.Guardrails docs βStructured outputs work with the Responses API
JSON Schema and json_object modes are now available on/v1/responses, matching the Chat Completions feature set.Structured outputs docs βAzure EU regions auto-detected
Azure deployments across European regions are now automatically recognized under the EU filter in the Model Library and routing engine.EU routing docs βRedesigned model management
Provider grouping with expand/collapse, region and capability filters, bulk approve/remove, preset quick-filters, and a βNewβ tab that surfaces recently released models per provider.Manage approved models βCompare models side by side in the playground
Pick two models, send the same prompt, and see which responds better. The redesigned chat playground also supports image attachments and markdown rendering.See service account details at a glance
Expandable table rows now show each service accountβs API keys, monthly spend, and creator at a glance.Service accounts docs βSend PDFs in your requests
The gateway extracts and formats PDF content across providers that support document input. Just include the file in your chat completions request.PDF support docs βPick a use case, skip the model selection
Dedicated model aliases likecoding/ select the right model, provider, and parameters for your workload automatically.Dedicated models docs β