Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.requesty.ai/llms.txt

Use this file to discover all available pages before exploring further.

May 2026
NewAPISecurity

Enforce SSO for your organization

Lock down access with Entra ID (Azure AD), Okta, or any OIDC/SAML provider. Members authenticate through your identity provider and land directly in Requesty.Set up SSO β†’

Restrict models per API key with Access Lists

Create named model allow-lists and attach them to individual API keys or groups. Control exactly which models each key can call without touching your org-wide settings.Create an access list β†’

Use the Responses API through Requesty

Route OpenAI /v1/responses calls through the gateway with full analytics, fallback, and cost tracking. Custom tool types are supported.Responses API reference β†’

See the cost of every request inline

API responses now include a usage.cost field with the exact dollar amount. For streaming, set stream_options.include_usage to get cost on the final chunk.Cost tracking docs β†’

Query organization-level usage

A new Management API endpoint returns aggregated spend and token counts across your entire org, with the same time filters available on key-level usage.Org Usage API β†’

Filter models by deployment region

The /v1/models endpoint now returns geolocation data for each model. The Model Library shows EU/US region chips so you can pick the right model before routing.EU routing docs β†’

Route Pi through Requesty

Connect the Pi coding agent for model routing, cost tracking, and fallback policies across your coding workflows.Set up Pi β†’

Clearer error messages from every provider

Context length overflows, unsupported image formats, and other provider errors are now translated into plain, actionable messages instead of generic errors.
April 2026
NewAPIIntegrations

Generate speech and transcribe audio

Two new endpoints: /v1/audio/speech for text-to-speech and /v1/audio/transcriptions for speech-to-text. Multiple providers including OpenAI and Mistral with automatic fallback.Speech API β†’ Β· Transcription API β†’

Edit images through the gateway

Send image edit requests through /v1/images/edits with the same multi-provider routing and fallback as generation.Image Edits API β†’

Tag traffic by app with analytics headers

Pass HTTP-Referer and X-Title headers to label requests by app or site. Filter your analytics dashboard by these values to see cost and latency per integration.Analytics headers docs β†’

Get alerted before budgets run out

Set dollar thresholds on API keys and receive Slack or Microsoft Teams webhooks when spend crosses them. Configurable trigger percentages give you time to act.Configure alerts β†’

Route Claude Cowork through your org

Use Requesty as the backend for the Claude Cowork desktop assistant. All traffic gets unified analytics, cost controls, and model policies.Set up Claude Cowork β†’

Connect OpenCode with one-liner analytics

Route OpenCode terminal agent traffic through Requesty. A one-line installer adds analytics tracking to your setup.Set up OpenCode β†’

Control guardrail actions per policy

The admin panel now lets you set each guardrail policy to Disabled, Report, or Mask individually. A new violations column and detail tab in logs shows exactly what fired.Guardrails docs β†’

Structured outputs work with the Responses API

JSON Schema and json_object modes are now available on /v1/responses, matching the Chat Completions feature set.Structured outputs docs β†’

Azure EU regions auto-detected

Azure deployments across European regions are now automatically recognized under the EU filter in the Model Library and routing engine.EU routing docs β†’
March 2026
NewImproved

Redesigned model management

Provider grouping with expand/collapse, region and capability filters, bulk approve/remove, preset quick-filters, and a β€œNew” tab that surfaces recently released models per provider.Manage approved models β†’

Compare models side by side in the playground

Pick two models, send the same prompt, and see which responds better. The redesigned chat playground also supports image attachments and markdown rendering.

See service account details at a glance

Expandable table rows now show each service account’s API keys, monthly spend, and creator at a glance.Service accounts docs β†’

Send PDFs in your requests

The gateway extracts and formats PDF content across providers that support document input. Just include the file in your chat completions request.PDF support docs β†’

Pick a use case, skip the model selection

Dedicated model aliases like coding/ select the right model, provider, and parameters for your workload automatically.Dedicated models docs β†’

Google Gemini Embedding 2.0 support

Generate embeddings with Google’s latest Gemini Embedding 2.0 model through Requesty, with automatic provider selection.
February 2026
NewImprovedAnalytics

Deeper analytics with percentiles and pivot tables

The analytics dashboard now supports P95 and P99 latency percentiles, pivot tables for multi-dimensional breakdowns, and flexible time ranges including This Week, Month, Quarter, and Year.Usage analytics β†’ Β· Performance monitoring β†’

See which models support tool calling

The models list now shows which models support tool calling. Use this to filter models by capability before routing or to build smarter model selection.

Connect OpenClaw agents

Route OpenClaw autonomous agent workloads through Requesty for unified analytics and cost controls.Set up OpenClaw β†’

Restrict models per group

Groups can now have their own approved model list, independent of the org-wide setting. Regional model approval handles providers with location-specific deployments correctly.Groups docs β†’

Spending alerts with Slack webhooks

Set dollar thresholds on your organization and receive Slack notifications when spend crosses them.Alerts docs β†’

Polished API keys table

The API keys table is easier to scan with cleaner columns, hover tooltips for long values, and one-click copy. The same improved layout appears in both admin and user views.
January 2026
NewImprovedAnalytics

Compare two requests side by side

Select any two requests in the logs table and open a JSON diff viewer. Added, modified, and removed fields are highlighted with one-click filtering.

Filter traces by key, user, or ID

Filter the traces page by trace ID, API key name, or user email. Cached percentage is now visible per trace.Session reconstruction docs β†’

Manage group budgets inline

Groups now show spend percentage and budget overrides directly in the table. Admins can adjust limits without navigating away.Groups docs β†’

Update member roles from the dashboard

Org admins can change member roles at both the organization and group level. Safety checks prevent admins from accidentally demoting themselves.Users and roles docs β†’

Reasoning tokens visible in logs

A new column shows how many reasoning tokens each request consumed, giving visibility into model β€œthinking” costs.

Filter analytics by API key label

Scope cost, latency, and usage breakdowns to specific API key labels for more targeted reporting.Usage analytics β†’
Last modified on May 15, 2026