Profiling
Create profiling data usingwafer-cli on a machine with ROCm installed.
Requirements
- ROCm 7.0+ with
rocprof-compute3.2+ - AMD GPU with ROCm support (MI300X, MI250, MI210, MI100)
- HIP application to profile
Profile an Application
workloads/my_kernel/ directory with profiling data.
Analyze Results
Launch GUI
Command Options
Profile Options
| Option | Description |
|---|---|
--name NAME | Workload directory name (required) |
--no-roof | Skip roofline analysis (faster) |
--roof-only | Roofline only, no detailed counters (fastest) |
--block SQ,TCC,... | Profile specific hardware blocks |
--kernel K1,K2,... | Filter by kernel name |
--dispatch 0,1,2,... | Filter by dispatch ID |
--path DIR | Output directory (default: workloads/) |
Analyze Options
| Option | Description |
|---|---|
--gui | Launch interactive GUI |
--port PORT | GUI server port (default: 8050) |
--json | Output as JSON |
--list-stats | List all kernels and dispatches |
Examples
Output Structure
Troubleshooting
rocprof-compute not found
rocprof-compute not found
Ensure ROCm is installed and
rocprof-compute is on your PATH:No profiling data generated
No profiling data generated
Check that your application actually launches GPU kernels. Try running with
rocprof-compute directly to see verbose output:0 kernels in analysis
0 kernels in analysis
Kernel summaries are generated by the analyze step. Run:The Wafer analyzer falls back to timing data if summaries aren’t available.
GUI doesn't open
GUI doesn't open
Check if port 8050 is in use. Try a different port:
Overview
← Back to ROCprofiler Compute overview