Support for partition_spaces and separate execution space instances (GPU streams) in Kokkos Kernels. #1119

wlruys · 2021-09-29T19:55:55Z

Opening up an issue for this after a conversation on the Slack. (feature-request)

Now that CUDA/HIP/SYCL stream support and partition_spaces are developed and more stable in Kokkos Core, it would be great to have this support in Kokkos Kernels as well.

This would allow dispatching BLAS and other kernels of 'medium' size, that are too large for a single block thread team and too small to be worth locking the whole device.

For instance something like:

ExecSpace spaces[N];
partition_space(ExecSpace(),N,spaces);
KokkosBlas::GEMM(spaces[0], "N", "N", one, A0, B0, one, C0);
KokkosBlas::GEMM(spaces[1], "N", "N", one, A1, B1, one, C1);

to dispatch the two kernels asynchronously.

The text was updated successfully, but these errors were encountered:

lucbv · 2021-10-25T16:28:02Z

@dialecticDolt
I merged the work on this feature in PR #1131 let me know if that meets your requirements?
If so we can probably close this issue, otherwise let's discuss what more is needed.

lucbv mentioned this issue Oct 6, 2021

Stream interface: adding stream support in GEMV and GEMM #1131

Merged

brian-kelley mentioned this issue Mar 20, 2023

Add exec instance support to sort/sort_and_merge utils #1744

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for partition_spaces and separate execution space instances (GPU streams) in Kokkos Kernels. #1119

Support for partition_spaces and separate execution space instances (GPU streams) in Kokkos Kernels. #1119

wlruys commented Sep 29, 2021 •

edited

Loading

lucbv commented Oct 25, 2021

Support for partition_spaces and separate execution space instances (GPU streams) in Kokkos Kernels. #1119

Support for partition_spaces and separate execution space instances (GPU streams) in Kokkos Kernels. #1119

Comments

wlruys commented Sep 29, 2021 • edited Loading

lucbv commented Oct 25, 2021

wlruys commented Sep 29, 2021 •

edited

Loading