0
I Use This!
Moderate Activity

Commits : Listings

Analyzed about 5 hours ago. based on code collected about 5 hours ago.
Jun 25, 2024 — Jun 25, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Rename AMD sources: .cpp -> .hip, add preprocessing for .hip sources More... 8 months ago
Add preprocessing for .cu sources More... 8 months ago
Minor renaming: useGPU -> do_useGPU in elpa1_template.F90 More... 8 months ago
Save-1. Hipifly elpa_pxgemm_multiply kernels -- create a universal cuda/hip interface More... 8 months ago
Split elpa_multiply_a_b_cuda.cu to elpa_hermitian_multiply_cuda.cu and elpa_pxgemm_multiply_cuda.cu (and same for hip) More... 8 months ago
Enable switching on/off GPU branches for individual steps in ELPA1 (tridiag, solve_tridi, trans_ev) More... 8 months ago
Add new key-value pair (runtime option) pxgemm_for_generalized, with default value 1 More... 8 months ago
Bugfix for elpa_hermitian_multiply, CPU-codepath More... 8 months ago
Bugfixes for elpa_pxgemm_multiply complex datatype More... 8 months ago
Add explicit interfaces and delete unused parameters uplo_a, uplo_c for elpa_pxgemm_multiply More... 8 months ago
Add tests for non-hermitian (non-symmetric) matrices for elpa_pxgemm More... 8 months ago
Bugfix: pxgemm, square-grid GPU codepath More... 8 months ago
Add pxgemm HIP kernels More... 8 months ago
Add HIP, SYCL, OpenMP offload fortran interfaces for pxgemm More... 8 months ago
Switch on square-grid codepaths in pxgemm More... 8 months ago
Switch on elpa_pxgemm_multiply instead of elpa_hermitian_multiply for generalized eigenproblem More... 8 months ago
Create d_ptr codepath for pxgemm-NN,TT More... 8 months ago
Cleanup NVTX labels in multiply_a_b/elpa_pxgemm_a_b_template.F90 More... 8 months ago
pxgemm-TN,NT: cleanup and optimize d_ptr codepath More... 8 months ago
Bugfix pxgemm: introduce CPU buffer for no-send op in elpa_transpose_row_or_col, non-NCCL More... 8 months ago
Bugfix pxgemm: reduce max number of threads to 512 in cuda_update_c_tn_nt_kernel More... 8 months ago
Port pxgemm-TN,NT nonsquare-grid codepath to NCCL More... 8 months ago
Port update_c to GPU in pxgemm More... 8 months ago
Merge branch 'origin/master_pre_stage' into peter_nonhermitian_multiply More... 9 months ago
Initial port of pxgemm-TN,NT nonsquare-grid coepath to GPU More... 9 months ago
Merge branch 'peter_ubuntu_cholesky_fix' into 'master_pre_stage' More... 9 months ago
Fix single-precision case More... 9 months ago
Increase the error check limit for single precision Cholesky from 0.09 to 0.3 to silence the problem for builds with Reference LAPACK on Ubuntu More... 9 months ago
Implement pxgemm-NT nonsquare-grid CPU codepath More... 9 months ago
Fix error in non-stream case More... 9 months ago