This repository has been archived by the owner on Jan 26, 2024. It is now read-only.
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
SWDEV-362046 - Report HIP_OPS activities using the ROCr driver_node_i…
…d instead of the device's index The ROCclr assigns zero-based IDs to GPUs in the order they are discovered. That zero-based ID is what is used to identify the GPU on which the HIP_OPS activity took place. When multiple ranks are used, each rank's first logical device always has GPU ID 0, regardless of which physical device is selected with CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from multiple ranks, GPU IDs from different processes may overlap. The long term solution is to use the KFD's gpu_id which is stable across APIs and processes. Unfortunately the gpu_id is not yet exposed by the ROCr, so for now use the driver's node id. Change-Id: Ib78854527d600d175bb76e2df0747c33f898c615 (cherry picked from commit 7de8e6b)
- Loading branch information