Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
YichengDWu authored May 2, 2024
1 parent 130a240 commit 7e5d46b
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,3 +11,10 @@ The primary purpose of developing this library is my desire to learn CuTe.
The name **Mo Ye** is derived from an ancient Chinese [legend of swordsmiths](https://en.wikipedia.org/wiki/Gan_Jiang_and_Mo_Ye).

The documentation is mostly my learning notes. Please refer to CuTe's documentation for more details.

GEMM essentially faces two main performance hurdles:

1. Swizzling to prevent bank conflicts.
2. An efficient epilogue, which involves transferring data from registers to shared memory, followed by a vectorized transfer back to global memory.

Since I've sold my computer, I no longer have access to an NVIDIA GPU, thus the development of this library will be put on hold indefinitely.

0 comments on commit 7e5d46b

Please sign in to comment.