Skip to content

Commit

Permalink
Apply 2D blocking to all kernels (#156)
Browse files Browse the repository at this point in the history
This change improves tinyBLAS so that Windows users (who
aren't using cuBLAS) can expect a 13% boost in both token
generation speed and batch prompt / image processing.
  • Loading branch information
ahgamut authored Jan 3, 2024
1 parent d802427 commit c0589f0
Showing 1 changed file with 214 additions and 154 deletions.
Loading

0 comments on commit c0589f0

Please sign in to comment.