
Get your Wafer Pass: https://www.wafer.ai/pass
Wafer Pass is in early access. We’re onboarding developers in small batches. Features, availability, and pricing are subject to change.
Connection Details
Use the credentials from your Wafer access email with these values:| OpenAI-compatible endpoint | https://pass.wafer.ai/v1 |
| Anthropic-compatible endpoint | https://pass.wafer.ai/v1/messages |
| Model | Qwen3.5-397B-A17B |
| Authentication | API key |
| Concurrency | 1 inflight request per user |
Claude Code uses the Anthropic Messages endpoint. Set
ANTHROPIC_BASE_URL=https://pass.wafer.ai and ANTHROPIC_API_KEY to your Wafer key — Claude Code will hit /v1/messages automatically and Wafer routes all requests to the fastest available model regardless of what model name Claude Code sends. All other harnesses (OpenClaw, Cline, Roo Code, etc.) use the OpenAI-compatible endpoint at https://pass.wafer.ai/v1.What’s Included
With an active Wafer Pass subscription you get:- Qwen3.5-397B-A17B requests with zero per-token costs (optimized by Wafer Inference Engine, 128K context window)
- Access through a standard OpenAI-compatible API and an Anthropic-compatible Messages API using your Wafer API key
- Works with Claude Code, OpenClaw, Hermes Agent, Cline, Roo Code, Kilo Code, OpenHands, and other agent harnesses
- 1 concurrent request today, with higher inflight limits coming soon
- New fast models as we release them — same subscription, no price increase
Pricing
| Plan | Price | Requests / 5hr window | Overage (input) | Overage (output) |
|---|---|---|---|---|
| Starter | $40/mo | 1,000 | $0.60/M tokens | $4.00/M tokens |
| Pro | $100/mo | 5,000 | $0.40/M tokens | $2.60/M tokens |
| Max | $250/mo | 20,000 | $0.30/M tokens | $2.00/M tokens |
Getting Started
Apply for access
Go to wafer.ai/pass and pick your plan. We’re onboarding in small batches and will notify you when your spot opens.
Receive your access email
Once you’re approved, we’ll send you your Wafer endpoint, model ID, and API key.
Set Up Claude Code
Wafer exposes an Anthropic-compatible Messages endpoint athttps://pass.wafer.ai/v1/messages, so Claude Code can connect directly — no proxy needed.
Configure Wafer as the endpoint
Set these environment variables in your shell profile (Or add them to Replace
~/.zshrc, ~/.bashrc, etc.):~/.claude/settings.json for a persistent, per-user config:YOUR_WAFER_API_KEY with the key from your Wafer access email.Set Up OpenClaw
Set Up Hermes Agent
Set Up Cline
Install Cline
Install the Cline extension from the VS Code marketplace, or search “Cline” in VS Code Extensions.
Configure Wafer as a provider
- Open VS Code and click the Cline icon in the sidebar
- Click the settings gear icon in the Cline panel
- In the API Provider dropdown, select OpenAI Compatible
- Fill in these fields:
- Base URL:
https://pass.wafer.ai/v1 - API Key: your Wafer API key
- Model ID:
Qwen3.5-397B-A17B
Set model info (recommended)
Expand Model Configuration and set:
- Context Window Size:
131072 - Max Output Tokens:
32768 - Supports Images: unchecked
Set Up Roo Code
Install Roo Code
Install the Roo Code extension from the VS Code marketplace, or search “Roo Code” in VS Code Extensions.
Configure Wafer as a provider
- Open VS Code and click the Roo Code icon in the sidebar
- Click the settings gear icon in the Roo Code panel
- In the API Provider dropdown, select OpenAI Compatible
- Fill in these fields:
- Base URL:
https://pass.wafer.ai/v1 - API Key: your Wafer API key
- Model ID:
Qwen3.5-397B-A17B
Set model info (recommended)
Optionally configure:
- Context Window Size:
131072 - Max Output Tokens:
32768
Set Up Kilo Code
Install Kilo Code
Install the Kilo Code extension from the VS Code marketplace, or search “Kilo Code” in VS Code Extensions.
Configure Wafer as a provider
- Open Kilo Code and click the settings gear icon
- Go to the Providers tab
- Click Custom provider at the bottom
- Fill in the dialog:
- Provider ID:
wafer - Display Name:
Wafer - Base URL:
https://pass.wafer.ai/v1 - API Key: your Wafer API key
- Model:
Qwen3.5-397B-A17B
- Click Save
Set Up OpenHands
Install OpenHands
Follow the OpenHands installation guide. The quickest way:
Configure Wafer as the LLM (UI)
- Open the OpenHands UI (usually at
http://localhost:3000) - Click the settings gear icon
- Click Advanced to expand advanced options
- Set these fields:
- Custom Model:
openai/Qwen3.5-397B-A17B - Base URL:
https://pass.wafer.ai/v1 - API Key: your Wafer API key
Alternative: config.toml
If you prefer file-based config, create or edit
config.toml in the project root:Use Wafer with Other Harnesses
Most agent harnesses only need these settings: OpenAI-compatible harnesses (Cline, Roo Code, Kilo Code, OpenClaw, OpenHands, etc.):- Base URL:
https://pass.wafer.ai/v1 - Model:
Qwen3.5-397B-A17B - Authentication: your Wafer API key
- Compatibility mode: OpenAI-compatible / OpenAI API
- Base URL:
https://pass.wafer.ai(the tool appends/v1/messagesautomatically) - Authentication: your Wafer API key via
ANTHROPIC_API_KEY - Model: no override needed — Wafer routes all requests to the fastest available model
Wafer. If it asks whether your key is a bearer token or an API key, use the same Wafer key from your access email.
FAQ
What model do I get?
What model do I get?
Today Wafer Pass gives you access to
Qwen3.5-397B-A17B through the Wafer endpoint. More fast models are coming.Can I use Wafer Pass with any model?
Can I use Wafer Pass with any model?
Wafer Pass currently covers
Qwen3.5-397B-A17B only. We’re adding more models soon — same subscription, no price increase.Can I share my subscription?
Can I share my subscription?
How do I get access?
How do I get access?
Apply at wafer.ai/unlimited. We’re onboarding in small batches.
Do I need a special model ID?
Do I need a special model ID?
Yes. Use
Qwen3.5-397B-A17B with the https://pass.wafer.ai/v1 endpoint.How many requests can I run at once?
How many requests can I run at once?
Today each user gets 1 concurrent request. We expect to raise that limit over time.
Will more models be added?
Will more models be added?
Yes. We’re optimizing the best coding models and adding them to the plan. Price stays the same.