Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.wafer.ai/llms.txt

Use this file to discover all available pages before exploring further.

Use these endpoint-scoped APIs to:
  • list accessible endpoints
  • read JSON metrics
  • inspect recent requests
  • look up a request by x-request-id

What You Need

  • <API_KEY>
  • <ENDPOINT_HOST>

Base URLs

SurfaceBase URL
Inferencehttps://<ENDPOINT_HOST>/v1
Metrics and request inspectionhttps://api.wafer.ai/v1/endpoints
The current customer observability surface is JSON. A Prometheus export surface is not documented yet.

List Endpoints

curl -s "https://api.wafer.ai/v1/endpoints" \
  -H "Authorization: Bearer <API_KEY>"
{
  "endpoints": [
    {
      "endpoint": "<ENDPOINT_HOST>",
      "inference_engine": "<INFERENCE_ENGINE>"
    }
  ]
}

Routes

  • GET /v1/endpoints
  • GET /v1/endpoints/metrics?endpoint=<ENDPOINT_HOST>&range_minutes=<RANGE_MINUTES>
  • GET /v1/endpoints/requests?endpoint=<ENDPOINT_HOST>&limit=<LIMIT>&errors_only=<true|false>
  • GET /v1/endpoints/requests/<REQUEST_ID>?endpoint=<ENDPOINT_HOST>

Current Limits

  • GET /v1/endpoints: 20/minute
  • GET /v1/endpoints/metrics: 20/minute
  • GET /v1/endpoints/requests: 30/minute
  • GET /v1/endpoints/requests/{request_id}: 30/minute

Notes

  • The endpoint query parameter is required on endpoint-scoped routes.
  • A single API key may be authorized for more than one dedicated endpoint.
  • Use x-request-id to look up a single request.