Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.wafer.ai/llms.txt

Use this file to discover all available pages before exploring further.

Wafer Serverless uses the same hosted inference endpoint as Wafer Pass, but bills per token instead of using a Pass request window. Use Serverless when you want pay-as-you-go access, top-up/prepaid billing, or access to models that are not included in a flat Wafer Pass plan.

Connection Details

OpenAI-compatible endpointhttps://pass.wafer.ai/v1
Anthropic-compatible endpointhttps://pass.wafer.ai/v1/messages
Send your API key asAuthorization: Bearer <key>
Request-scoped ZDRAdd Wafer-ZDR: required on direct API calls

API Capabilities

Serverless supports request-scoped privacy and advanced completion controls:

Models

Call GET https://pass.wafer.ai/v1/models to see the currently public Serverless catalog. Each model card includes whether the model supports ZDR.
curl -sS "https://pass.wafer.ai/v1/models" \
  -H "Authorization: Bearer <YOUR_WAFER_API_KEY>"
The public catalog only lists models that are ready for general use. Some routed models may be available by allowlist before they appear in the public catalog.