0
I Use This!
Very High Activity

Commits : Listings

Analyzed about 4 hours ago. based on code collected about 5 hours ago.
Jun 19, 2024 — Jun 19, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
[AMD] Support [b]f16 <-> OCP [b]f8 conversions on gfx950 (#6185) More... 3 months ago
[AMD] Update the GPU product names (#6193) More... 3 months ago
Fixed `grid` Lambda Function in `LayerNorm` and type anotation of `broadcast_impl_shape` and `get_block_shapes` (#6189) More... 3 months ago
[NVIDIA] Propagate TMA encoding from MMA operand (#6045) More... 3 months ago
[AMD] Allow tl.assume in the main loop body when pipelining (#6162) More... 3 months ago
Disable multithreaded MLIR compilation with MLIR_DISABLE_MULTITHREADING (#6184) More... 3 months ago
[AMD] Enable LDS transpose load for fp8 types on gfx950 (#6159) More... 3 months ago
[BACKEND] Replacing IRRewriterBase with OpBuilder in couple of places (#6187) More... 3 months ago
Update ptxas to version 12.8.93 (#6149) More... 3 months ago
[AMD] Fix one cluster pingpong with extra ops after local load (#6183) More... 3 months ago
[AMD] Disable loop pipelining when there is assert or print (#6180) More... 3 months ago
[TritonGPU] Add `ttng.arrive_barrier` (#6174) More... 3 months ago
[TritonGPU] Fix extra space in local_alloc asm format (NFC) (#6173) More... 3 months ago
[BACKEND] Fix clang warnings (#6163) More... 3 months ago
[LAYOUTS] Kill getThreadsPerWarp & ...WithUniqueData (#6170) More... 3 months ago
[LAYOUTS] Cache LinearEncoding creation (#6169) More... 3 months ago
[AMD] Error out if exceeding max threads per block (#6125) More... 3 months ago
[OPTIMIZER] Customize LICM to hoist loads from loops (#6051) More... 3 months ago
[BACKEND] Bump to llvm/llvm-project@2619c2ed58 (#6148) More... 3 months ago
[PIPELINE] AssignLatencies for mmav5 (#6077) More... 3 months ago
Annotations for elementary dtypes and pointers (#6152) More... 3 months ago
[AMD] Use pointee type for buffer op alignment in AxisAnalysis (#6145) More... 3 months ago
[AMD] Enable subview for amd rotating shared attribute (#6160) More... 3 months ago
[AMD] Support ConvertLayout in CanonicalizePointers (#6142) More... 3 months ago
[LAYOUTS] Get warp number and thread number from Module (#6068) More... 3 months ago
[AMD] Test transposed B for scaled dot fp8/bf8 types (#6078) More... 3 months ago
[PROTON] Get the context depth of profiling sessions (#6158) More... 3 months ago
[AMD] Only consider memory ops feeding to dot in ping-pong pass (#6108) More... 3 months ago
[BACKEND] Enable vectorized fp8 cast on Ada GPUs (#6156) More... 3 months ago
Enabling subtiling for Hopper WGMMA (#6130) More... 3 months ago