Skip to content

Actions: ROCm/triton

AMD Perf Kernel Post-Merge Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
50 workflow runs
50 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Use numpy 1.26 in CI
AMD Perf Kernel Post-Merge Tests #51: Commit 16ce746 pushed by vgokhale
December 23, 2024 22:37 1h 32m 0s main_perf
December 23, 2024 22:37 1h 32m 0s
stream-k v0.4 (#689)
AMD Perf Kernel Post-Merge Tests #50: Commit 80626a9 pushed by xiaohuguo2023
December 23, 2024 15:19 29m 7s main_perf
December 23, 2024 15:19 29m 7s
Add gemm tuning configs for weekly tuning CI (#662)
AMD Perf Kernel Post-Merge Tests #49: Commit 4a7afd2 pushed by AlexAUT
December 20, 2024 08:38 49s main_perf
December 20, 2024 08:38 49s
Merge pull request #670 from ROCm/jukorhon/atomic-counter
AMD Perf Kernel Post-Merge Tests #48: Commit d31692c pushed by juuso-oskari
December 20, 2024 08:36 37s main_perf
December 20, 2024 08:36 37s
enable stream pipeline for persistent rmsnorm kernel (#686)
AMD Perf Kernel Post-Merge Tests #47: Commit e1245da pushed by xiaohuguo2023
December 19, 2024 17:06 53m 17s main_perf
December 19, 2024 17:06 53m 17s
Load scales instead of constexpr (#684)
AMD Perf Kernel Post-Merge Tests #46: Commit c086d08 pushed by vgokhale
December 18, 2024 17:31 2h 26m 28s main_perf
December 18, 2024 17:31 2h 26m 28s
Merge pull request #669 from ROCm/tianxing/FA-int8
AMD Perf Kernel Post-Merge Tests #45: Commit cd6f51b pushed by Chi-Chu319
December 18, 2024 11:52 54m 4s main_perf
December 18, 2024 11:52 54m 4s
implement persistent loop based rmsnorm kernel (#676)
AMD Perf Kernel Post-Merge Tests #44: Commit 9cdcf1d pushed by xiaohuguo2023
December 16, 2024 17:15 43m 44s main_perf
December 16, 2024 17:15 43m 44s
Query lds size for tune_gemm and tune_streamk (#680)
AMD Perf Kernel Post-Merge Tests #43: Commit 40a9963 pushed by AlexAUT
December 12, 2024 09:14 1h 31m 23s main_perf
December 12, 2024 09:14 1h 31m 23s
Update CI to use pytorch:latest (#679)
AMD Perf Kernel Post-Merge Tests #42: Commit 83bd6b0 pushed by vgokhale
December 11, 2024 15:57 2h 24m 54s main_perf
December 11, 2024 15:57 2h 24m 54s
Add scaling support for 8-bit types
AMD Perf Kernel Post-Merge Tests #41: Commit c4d9d9d pushed by vgokhale
December 10, 2024 17:02 55m 29s main_perf
December 10, 2024 17:02 55m 29s
add blocked version to address performance issue of when N is large (…
AMD Perf Kernel Post-Merge Tests #40: Commit 736071f pushed by xiaohuguo2023
December 6, 2024 22:18 1h 11m 35s main_perf
December 6, 2024 22:18 1h 11m 35s
use gfx942 (#673)
AMD Perf Kernel Post-Merge Tests #39: Commit 27a1b5b pushed by micmelesse
December 5, 2024 15:22 1h 9m 47s main_perf
December 5, 2024 15:22 1h 9m 47s
Streamk v0.3 (#660)
AMD Perf Kernel Post-Merge Tests #38: Commit 5e49eae pushed by zhanglx13
December 4, 2024 20:37 40m 19s main_perf
December 4, 2024 20:37 40m 19s
rmsnorm optimization for M = 1 (#668)
AMD Perf Kernel Post-Merge Tests #37: Commit fc558e7 pushed by vgokhale
December 2, 2024 20:39 40m 20s main_perf
December 2, 2024 20:39 40m 20s
Merge pull request #649 from ROCm/ravil/main_perf
AMD Perf Kernel Post-Merge Tests #36: Commit 6e7ad94 pushed by ravil-mobile
November 26, 2024 17:53 54m 58s main_perf
November 26, 2024 17:53 54m 58s
Test chained dot (FP8 case, shuffle conversion) (#665)
AMD Perf Kernel Post-Merge Tests #35: Commit 94961d9 pushed by zhanglx13
November 21, 2024 06:03 40m 15s main_perf
November 21, 2024 06:03 40m 15s
[tuner] dump outputs to tune_gemm/output (#663)
AMD Perf Kernel Post-Merge Tests #34: Commit db2ca01 pushed by zhanglx13
November 13, 2024 20:22 42m 15s main_perf
November 13, 2024 20:22 42m 15s
[tune_gemm] Update the filter for LDS usage for stream-pipelineV2 (#661)
AMD Perf Kernel Post-Merge Tests #33: Commit 279cfa7 pushed by zhanglx13
November 13, 2024 03:26 39m 22s main_perf
November 13, 2024 03:26 39m 22s
Set num_stage to 2 from 0 in tune_gemm (#658)
AMD Perf Kernel Post-Merge Tests #32: Commit 086312b pushed by zhanglx13
November 6, 2024 15:08 40m 21s main_perf
November 6, 2024 15:08 40m 21s
Cleanup RMSNorm (#656)
AMD Perf Kernel Post-Merge Tests #31: Commit 7c07f4a pushed by vgokhale
November 5, 2024 16:00 40m 28s main_perf
November 5, 2024 16:00 40m 28s
Merge pull request #655 from ROCm/fixPipelineNumStages
AMD Perf Kernel Post-Merge Tests #30: Commit 1fe4e73 pushed by sjw36
November 1, 2024 15:30 39m 59s main_perf
November 1, 2024 15:30 39m 59s
add stream-k v0.2 (#652)
AMD Perf Kernel Post-Merge Tests #29: Commit 1d60b05 pushed by xiaohuguo2023
October 31, 2024 19:45 40m 4s main_perf
October 31, 2024 19:45 40m 4s
Update num_stages to 2 from 0 in perf regression test (#653)
AMD Perf Kernel Post-Merge Tests #28: Commit ab7f8f8 pushed by zhanglx13
October 30, 2024 13:23 40m 12s main_perf
October 30, 2024 13:23 40m 12s
REBASE fixes: disable mismatching fwd_bias config
AMD Perf Kernel Post-Merge Tests #27: Commit 628e09b pushed by micmelesse
October 28, 2024 15:11 39m 41s main_perf
October 28, 2024 15:11 39m 41s