Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.
cpu cuda tiling cublas cpp11 nvidia shared-memory reordering naive strassen kahan coppersmith-winograd matrix-multiply
-
Updated
Feb 8, 2023 - C++