0
I Use This!
Moderate Activity

Commits : Listings

Analyzed about 3 hours ago. based on code collected about 3 hours ago.
Jul 02, 2024 — Jul 02, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Merge branch 'gpu_cholesky' into 'master_pre_stage' More... over 1 year ago
Allow tuning of the tile-size in cholesky More... over 1 year ago
Fix non-mpi case in multiply_a_b GPU More... over 1 year ago
Cleanup of cannon_forw_template.c More... over 1 year ago
optimization_cholesky_7 add cusolverXpotrf,cuda_malloc_host_intptr interfaces More... over 1 year ago
Fix configure More... over 1 year ago
FIX GPU version of hermitian_multiply More... over 1 year ago
Split cudaFunctions_template.h, create cusolverFunctions_template.h More... over 1 year ago
Minor fixes in man pages More... over 1 year ago
optimization_cholesky_6 add gpu_accumulate_device_info More... over 1 year ago
optimization_cholesky_5d add hip_get_last_error More... over 1 year ago
Add hip_check_device_info More... over 1 year ago
optimization_cholesky_5d add get_last_error More... over 1 year ago
Adjust configure of rocsolver similarly to cusolver More... over 1 year ago
optimization_cholesky_5c cleanup rename vendor_agnostic_layer_utilities More... over 1 year ago
optimization_cholesky_5b cleanup - delete unused vendor_agnostic_layer_template.F90 More... over 1 year ago
optimization_cholesky_5a cleanup nccl_group_start_end, add gpu_stream_synchronize after nccl_bcast More... over 1 year ago
optimization_cholesky_5 mov _nccl_group_star out of the loop More... over 1 year ago
optimization_cholesky_4 eliminate memcpy tmp1_dev to tmp1 More... over 1 year ago
Merge branch 'master_pre_stage' into 'gpu_cholesky' More... over 1 year ago
optimization_cholesky_3 eliminate memcpy of info in cublasDpotrf, empty kernel_check_info More... over 1 year ago
optimization_cholesky_2 eliminate memcpy tmatc_dev -> tmatc More... over 1 year ago
Merge branch 'peter_fix_cublas_caching' into 'master_pre_stage' More... over 1 year ago
Fix --with-cusolver in configure.ac More... over 1 year ago
Fix usage of elpa_gpu_ccl_transpose_vectors in cholesky More... over 1 year ago
Fix hermitian_multiply 'L' and 'U' case on GPU More... over 1 year ago
Add elpa_gpu_ccl_transpose_vectors in cholesky More... over 1 year ago
Add linking to cublasLt for cublasLtHeuristicsCacheSetCapacity() More... over 1 year ago
Add headers for cublas caching fix to test/shared/GPU/CUDA/test_cudaFunctions.cu More... over 1 year ago
Change permissions of run_ci_tests_nccl_very_large.sh to 755 More... over 1 year ago