ROCprofiler Systems
ROCprofiler Systems (formerly Omnitrace) provides system-wide profiling for AMD GPU applications. It captures CPU-GPU interaction, kernel timelines, memory transfers, and system events—similar to NVIDIA Nsight Systems.Quick Start
Commands
wafer amd rocprof-systems check
Verify installation:wafer amd rocprof-systems run
Profile an application:| Option | Description |
|---|---|
--output, -o | Output directory |
--trace | Trace types: hip, hsa, marker, kernel |
--profile | Profile types: cpu, gpu, memory |
--duration | Maximum profiling duration |
wafer amd rocprof-systems analyze
Analyze profiling output:| Option | Description |
|---|---|
--summary | Show summary statistics |
--kernels | List kernel timings |
--transfers | Memory transfer analysis |
--json | Output as JSON |
wafer amd rocprof-systems sample
Sample-based CPU profiling with GPU correlation:| Option | Description |
|---|---|
--frequency | Sampling frequency in Hz (default: 1000) |
--output, -o | Output directory |
wafer amd rocprof-systems instrument
Binary instrumentation for detailed tracing:| Option | Description |
|---|---|
--functions | Functions to instrument |
--modules | Modules to instrument |
wafer amd rocprof-systems query
Query profiling data:| Option | Description |
|---|---|
--sql | SQL query for trace data |
--kernel | Filter by kernel name |
What Gets Captured
ROCprofiler Systems traces:- HIP API calls: hipMemcpy, hipLaunchKernel, etc.
- HSA events: Low-level GPU runtime events
- GPU kernels: Execution times, grid dimensions
- Memory transfers: H2D, D2H, D2D copies
- CPU activity: Thread scheduling, function calls
- Markers: Custom annotations (roctx)
- System events: Context switches, interrupts
Adding Markers
Annotate your code for better visibility:Output Formats
ROCprofiler Systems generates multiple output formats:| Format | File | Use |
|---|---|---|
| Perfetto | .proto | Chrome trace viewer |
| JSON | .json | Custom analysis |
| Text | .txt | Quick summary |
Comparison with Nsys
| Feature | ROCprof Systems | Nsys |
|---|---|---|
| GPU Runtime | HIP/HSA | CUDA |
| CPU Profiling | Yes | Yes |
| Markers | roctx | NVTX |
| Output | Perfetto/JSON | nsys-rep |
| Timeline View | Perfetto UI | Nsys UI |
Troubleshooting
rocprof-sys not found
rocprof-sys not found
Install from ROCm or build from source:
No GPU events captured
No GPU events captured
Ensure HIP tracing is enabled:
Output too large
Output too large
Limit trace scope or duration: