Wafer Serverless uses the same hosted inference endpoint as Wafer Pass, but bills per token instead of using a Pass request window. Use Serverless when you want pay-as-you-go access, top-up/prepaid billing, or access to models that are not included in a flat Wafer Pass plan.Documentation Index
Fetch the complete documentation index at: https://docs.wafer.ai/llms.txt
Use this file to discover all available pages before exploring further.
Connection Details
| OpenAI-compatible endpoint | https://pass.wafer.ai/v1 |
| Anthropic-compatible endpoint | https://pass.wafer.ai/v1/messages |
| Send your API key as | Authorization: Bearer <key> |
| Request-scoped ZDR | Add Wafer-ZDR: required on direct API calls |
API Capabilities
Serverless supports request-scoped privacy and advanced completion controls:Models
CallGET https://pass.wafer.ai/v1/models to see the currently public Serverless catalog. Each model card includes whether the model supports ZDR.