Releases · eth-cscs/DLA-Future

Changes

Introduced an option (*) for forcing contiguous GPU communication buffers. (#1096)
Introduced an option (*) for enabling GPU aware MPI communication. (#1102)
Removed special handling of Intel MKL, as it could lead to broken installations. (#1149)
- Spack installations: spack will set the correct variables.
- Manual installations: the user is responsible to correctly set variables (see BUILD.md).

(*) These options are available as spack variants.

Performance improvements

Don't communicate in algorithms when using single rank communicators. (#1097)
Fixed slow performance of local version of bt_band_to_tridiagonal (#1144)

Bug fixes

Implemented a workaround for hipMemcpyDefault 2D memcpys, due to bugs in HIP. (#1106)
Miniapps initialize HIP before MPI, as on older Cray MPICH versions initializing HIP after MPI leads to HIP not seeing any devices. (#1090)

Changes:

Modified CommunicatorGrid to avoid blocking calls to MPI_Comm_dup. It now returns communicator pipelines. (#993)
Added support for Intel oneMKL and the intel-oneapi-mkl spack package. (#1073) (*)

Performance improvements:

Reduced the size of the matrix-matrix multiplications in the tridiagonal eigensolver to cover only the non deflated part of the eigenvectors. (#951 #967 #996 #997 #998)
Introduced stackless threads where appropriate. (#1037)

Bug fixes:

Use drop_operation_state to avoid stack overflows. (#1004)

Notes:

(*) At the time of the release the spack spec blaspp~openmp ^intel-oneapi-mkl threads=openmp doesn't build. If you rely on multithreaded BLAS we suggest to use blaspp+openmp ^intel-oneapi-mkl threads=openmp until spack/spack#42087 gets merged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fixes

Bug fixes

Changes

Performance improvements

Bug fixes

Changes

Performance improvements

Bug fixes

Changes

Performance improvements

Bug fixes

Bug fixes

Changes:

Performance improvements:

Bug fixes:

Notes:

Bugfix:

Changes:

Performance improvements:

Bugfix:

Releases: eth-cscs/DLA-Future

DLA-Future 0.7.3

Bug fixes

DLA-Future 0.7.1

Bug fixes

DLA-Future 0.7.0

Changes

Performance improvements

Bug fixes

DLA-Future 0.6.0

Changes

Performance improvements

Bug fixes

DLA-Future 0.5.0

Changes

Performance improvements

Bug fixes

DLA-Future 0.4.1

Bug fixes

DLA-Future 0.4.0

Changes:

Performance improvements:

Bug fixes:

Notes:

DLA-Future 0.3.1

Bugfix:

DLA-Future 0.3.0

Changes:

Performance improvements:

DLA-Future 0.2.1

Bugfix: