Inference API Spec
OpenAPI spec for inference endpoints (chat, embeddings, images, audio, models).
What’s included
The Inference API spec covers all endpoints onrouter.requesty.ai:
| Endpoint | Method | Description |
|---|---|---|
/v1/chat/completions | POST | Create chat completion |
/v1/messages | POST | Create message (Anthropic format) |
/v1/responses | POST | Create response (Responses API) |
/v1/embeddings | POST | Create embedding |
/v1/audio/speech | POST | Create speech (TTS) |
/v1/audio/transcriptions | POST | Create transcription (STT) |
/v1/images/generations | POST | Create image |
/v1/images/edits | POST | Edit image |
/v1/models | GET | List models |
Web Search in the spec
The spec documents web search support across all three chat endpoints:- Chat Completions (
Toolschema):"type": "web_search"alongside"function" - Responses API (
ResponsesToolschema):"type": "web_search"alongside"function" - Messages API (
AnthropicToolschema):"type": "web_search_20250305"withnameandmax_uses
Usage examples
Import the spec into popular tools:- Postman: Import > Link > paste the spec URL
- Insomnia: Application > Import > From URL
- OpenAPI Generator:
openapi-generator-cli generate -i <spec-url> -g <language> - Swagger UI: Paste the spec URL into the explore bar