Skip to content

Releases: ROCm/rocWMMA

rocWMMA 1.6.0 for ROCm 6.3.1

20 Dec 16:13
ba38cf3
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.3.1 did not change. The library was rebuilt for the updated ROCm 6.3.1 stack.

rocWMMA 1.6.0 for ROCm 6.3.0

03 Dec 19:49
ba38cf3
Compare
Choose a tag to compare

Added

  • Added OCP f8/bf8 datatype support
  • Added support for gfx12 architecture targets

Changed

  • Optimized some aos<->soa transforms with half-rotation offsets
  • Refactored the rocBLAS reference entry point for validation and benchmarking
  • ROCWMMA_* preprocessor configurations are now all assigned values
  • Updated the default architecture targets for ASAN builds
  • Updated the actor-critic implementation

Resolved issues

  • Fixed a bug in f64 validation due to faulty typecasting
  • Fixed a bug causing runtime compilation errors with hipRTC
  • Various documentation updates and fixes

rocWMMA 1.5.0 for ROCm 6.2.4

06 Nov 19:56
677b441
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.2.4 did not change. The library was rebuilt for the updated ROCm 6.2.4 stack.

rocWMMA 1.5.0 for ROCm 6.2.2

27 Sep 16:01
677b441
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.

rocWMMA 1.5.0 for ROCm 6.2.1

20 Sep 19:58
677b441
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.

rocWMMA 1.5.0 for ROCm 6.2.0

02 Aug 16:15
677b441
Compare
Choose a tag to compare

Additions

  • Added internal utilities for element-wise vector transforms
  • Added internal utilities for cross-lane vector transforms
  • Implemented internal aos<->soa transforms for block sizes of 16, 32, 64, 128 and 256 and vector widths of 2, 4, 8 and 16
  • Added tests for new internal transforms

Changes

  • Improved loading layouts by increasing vector width for fragments with blockDim > 32
  • API applyDataLayout transform now accepts WaveCount template argument for cooperative fragments
  • API applyDataLayout transform now physically applies aos<->soa transform as necessary
  • Refactored entry-point of std library usage to improve hipRTC support
  • Documentation updates for installation, programmer's guide and API reference

Fixes

  • Fixed some header includes ordering to improve portability

rocWMMA 1.4.0 for ROCm 6.1.2

04 Jun 16:53
7dbd524
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.

rocWMMA 1.4.0 for ROCm 6.1.1

08 May 18:00
7dbd524
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.

rocWMMA 1.4.0 for ROCm 6.1.0

16 Apr 19:11
7dbd524
Compare
Choose a tag to compare

Additions

  • Added bf16 support for hipRTC sample

Changes

  • Changed Clang C++ version to C++17
  • Updated rocwmma_coop API
  • Linked rocWMMA to hiprtc

Fixes

  • Fixed compile/runtime arch checks
  • Built all test in large code model
  • Removed inefficient branching in layout loop unrolling

rocWMMA 1.3.0 for ROCm 6.0.2

31 Jan 20:13
4b10c7e
Compare
Choose a tag to compare

rocWMMA code for ROCm 6.0.2 did not change. The library was rebuilt for the updated ROCm 6.0.2 stack.