Skip to content

rocWMMA 0.8 for ROCm 5.3.0

Compare
Choose a tag to compare
@lawruble13 lawruble13 released this 30 Sep 19:27

Added

  • Added runtime checks to disable tests on non-target GPUS
  • Added workgroup aware gemm kernels
  • Added workgroup aware validation and benchmark test suite
  • Added warmup run to existing tests

Changed

  • Refactored lds_mapping_util into gemm global, local mapping, gemm driver, gemm config and scheduling classes
  • Modified resource allocation and tracking of gemm and dlrm buffers
  • Improved low-level data loading patterns
  • Reduced branching on cooperative load and store
  • Updated gemv sample
  • Updated gemm sample