Releases: ROCm/rocWMMA
Releases · ROCm/rocWMMA
rocWMMA 1.6.0 for ROCm 6.3.1
rocWMMA code for ROCm 6.3.1 did not change. The library was rebuilt for the updated ROCm 6.3.1 stack.
rocWMMA 1.6.0 for ROCm 6.3.0
Added
- Added OCP f8/bf8 datatype support
- Added support for gfx12 architecture targets
Changed
- Optimized some aos<->soa transforms with half-rotation offsets
- Refactored the rocBLAS reference entry point for validation and benchmarking
- ROCWMMA_* preprocessor configurations are now all assigned values
- Updated the default architecture targets for ASAN builds
- Updated the actor-critic implementation
Resolved issues
- Fixed a bug in f64 validation due to faulty typecasting
- Fixed a bug causing runtime compilation errors with hipRTC
- Various documentation updates and fixes
rocWMMA 1.5.0 for ROCm 6.2.4
rocWMMA code for ROCm 6.2.4 did not change. The library was rebuilt for the updated ROCm 6.2.4 stack.
rocWMMA 1.5.0 for ROCm 6.2.2
rocWMMA code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.
rocWMMA 1.5.0 for ROCm 6.2.1
rocWMMA code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.
rocWMMA 1.5.0 for ROCm 6.2.0
Additions
- Added internal utilities for element-wise vector transforms
- Added internal utilities for cross-lane vector transforms
- Implemented internal aos<->soa transforms for block sizes of 16, 32, 64, 128 and 256 and vector widths of 2, 4, 8 and 16
- Added tests for new internal transforms
Changes
- Improved loading layouts by increasing vector width for fragments with blockDim > 32
- API applyDataLayout transform now accepts WaveCount template argument for cooperative fragments
- API applyDataLayout transform now physically applies aos<->soa transform as necessary
- Refactored entry-point of std library usage to improve hipRTC support
- Documentation updates for installation, programmer's guide and API reference
Fixes
- Fixed some header includes ordering to improve portability
rocWMMA 1.4.0 for ROCm 6.1.2
rocWMMA code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.
rocWMMA 1.4.0 for ROCm 6.1.1
rocWMMA code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.
rocWMMA 1.4.0 for ROCm 6.1.0
Additions
- Added bf16 support for hipRTC sample
Changes
- Changed Clang C++ version to C++17
- Updated rocwmma_coop API
- Linked rocWMMA to hiprtc
Fixes
- Fixed compile/runtime arch checks
- Built all test in large code model
- Removed inefficient branching in layout loop unrolling
rocWMMA 1.3.0 for ROCm 6.0.2
rocWMMA code for ROCm 6.0.2 did not change. The library was rebuilt for the updated ROCm 6.0.2 stack.