0
I Use This!
Very High Activity

Commits : Listings

Analyzed 1 day ago. based on code collected 1 day ago.
Jun 25, 2024 — Jun 25, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
CUDA: fix Volta FlashAttention logic (llama/11615) More... 5 months ago
HIP: fix flash_attn_stream_k_fixup warning (llama/11604) More... 5 months ago
CUDA/HIP: add support for selectable warp size to mmv (llama/11519) More... 5 months ago
HIP: add GGML_CUDA_CC_IS_* for amd familys as increasing cc archtectures for amd gpus are not supersets of eatch other (llama/11601) More... 5 months ago
CUDA: use mma PTX instructions for FlashAttention (llama/11583) More... 5 months ago
`ci`: use sccache on windows instead of ccache (llama/11545) More... 5 months ago
HIP: require at least HIP 5.5 More... 5 months ago
HIP: Prepare reduction operators for wave 64 More... 5 months ago
vulkan: implement initial support for IQ2 and IQ3 quantizations (llama/11360) More... 5 months ago
CUDA/HIP: add warp_size to cuda_device_info More... 5 months ago
vulkan: Catch pipeline creation failure and print an error message (llama/11436) More... 5 months ago
HIP: Supress transformation warning in softmax.cu More... 5 months ago
HIP: Only call rocblas_initialize on rocblas versions with the multiple instantation bug (llama/11080) More... 5 months ago
cmake : don't fail on `GGML_CPU=OFF` (llama/11457) More... 5 months ago
SYCL : SOFTMAX F16 mask support and other fixes (llama/11261) More... 5 months ago
AMD: parse the architecture as supplied by gcnArchName (llama/11244) More... 5 months ago
metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441) More... 5 months ago
metal : use residency sets (llama/11427) More... 5 months ago
cmake: add ggml find package (llama/11369) More... 5 months ago
vulkan: compile shaders on-demand (llama/11406) More... 5 months ago
Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420) More... 5 months ago
hip : Add hipGraph and VMM support to ROCM (llama/11362) More... 5 months ago
CUDA: fix FP16 cuBLAS GEMM (llama/11396) More... 5 months ago
rocBLAS: Avoid fp32->fp16->fp32 conversion on cdna (llama/11356) More... 5 months ago
CPU/CUDA: fix (GQA) mul mat back, add CUDA support (llama/11380) More... 5 months ago
cmake : avoid -march=native when reproducible build is wanted (llama/11366) More... 5 months ago
ggml : add option to not print stack on abort (ggml/1081) More... 5 months ago
Vulkan-run-test: fix mmq_wg_denoms (llama/11343) More... 5 months ago
vulkan: sort shaders for more deterministic binary (llama/11315) More... 5 months ago
vulkan: fix diag_mask_inf (llama/11323) More... 5 months ago