An extension library of WMMA API (Tensor Core API)
-
Updated
Jul 12, 2024 - Cuda
An extension library of WMMA API (Tensor Core API)
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
SParse AcceleRation on Tensor Architecture
Fast SGEMM emulation on Tensor Cores
An extension library of WMMA API for single precision matrix operation using TensorCores and error correction technique
Add a description, image, and links to the tensorcores topic page so that developers can more easily learn about it.
To associate your repository with the tensorcores topic, visit your repo's landing page and select "manage topics."