jondeaton / fht-jax Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Fast Hadamard Transform CUDA bindings for JAX

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Repository files navigation

fwht-jax

Fast Walsh-Hadamard Transform CUDA bindings for JAX

Credit to Tri Dao for the CUDA kernel implementation from HazyResearch/structured-nets

pip install .

TODO

make a simple implementation (for non-GPU)
benchmark fused vs simple implementaiton
vmap rules
only supports float32, can se also support bfloat16??
async stream for better performance?

About

Fast Hadamard Transform CUDA bindings for JAX

Report repository

Releases

No releases published

Packages

No packages published

Languages