Kernel Development
Wafer provides a complete workflow for GPU kernel development—from understanding the baseline, through writing and testing custom kernels, to optimizing performance with AI assistance.Workflow
Tools
Evaluate
Test kernel correctness and benchmark performance.
Baseline
Discover what kernels PyTorch dispatches.
Roofline
Analyze performance against hardware limits.
Corpus
Download GPU documentation for reference.
Kernel Formats
Wafer supports two kernel formats:GPUMode Format
Simple function-based format:KernelBench Format
Class-based format compatible with KernelBench:AI-Assisted Development
Use the Wafer agent throughout your workflow:Quick Example
Complete workflow for optimizing a matrix multiplication:Remote Execution
Run on cloud GPUs or your own hardware:Next Steps
Evaluate Kernels
Start testing your kernels.
AI Agent
Get AI assistance.
NVIDIA Profiling
Profile NVIDIA kernels.
AMD Profiling
Profile AMD kernels.