AMD Profiling
Wafer integrates AMD’s profiling tools to help you analyze and optimize GPU performance on AMD hardware. From ISA analysis to system-wide profiling, these tools provide comprehensive visibility into your ROCm applications.Available Tools
ISA Analysis
Analyze GPU assembly, extract metrics, and understand instruction-level behavior.
ROCprofiler Compute
Kernel-level profiling with hardware counters, similar to NVIDIA NCU.
ROCprofiler SDK
Low-level profiling API for custom instrumentation.
ROCprofiler Systems
System-wide profiling for application analysis, similar to nsys.
Choosing a Tool
| Tool | Best For | Granularity |
|---|---|---|
| ISA Analysis | Assembly optimization, register analysis | Instruction-level |
| ROCprof Compute | Kernel metrics, roofline analysis | Per-kernel |
| ROCprof SDK | Custom profiling, counter collection | Flexible |
| ROCprof Systems | Timeline analysis, API tracing | System-wide |
Quick Commands
Analyze ISA:Requirements
| Tool | Requirement |
|---|---|
| ISA Analysis | ROCm installed, or Wafer server analysis |
| ROCprof Compute | ROCm with rocprofiler-compute |
| ROCprof SDK | ROCm with rocprofiler-sdk |
| ROCprof Systems | ROCm with rocprofiler-systems |
ISA analysis can run server-side without local AMD hardware. Upload your
.co, .s, .ll, or .ttgir files.Supported Hardware
Wafer supports profiling on:- MI300X — AMD Instinct data center GPU
- MI250X — AMD Instinct with CDNA 2 architecture
- MI210 — AMD Instinct for HPC
- MI100 — First-generation CDNA
- RX 7900 — RDNA 3 consumer GPUs
Typical Workflow
AMD vs NVIDIA Tool Mapping
| Purpose | AMD Tool | NVIDIA Tool |
|---|---|---|
| Kernel metrics | ROCprof Compute | NCU |
| System profiling | ROCprof Systems | Nsys |
| Assembly analysis | ISA Analysis | Nsight Compute SASS view |
| Counter collection | ROCprof SDK | CUPTI |