GitHub - calgaryml/SQuAsh: SQuAsh: Sparse, Quantized fine-tuning using Adapters at home

This repo is a work in progress. Fixed fan-in HIP kernels have been implemented but require further tuning as you can see below:

    docker build -t squash .

Check if amd GPU is renderD128 and renderD129 if you have 2 GPUs (or more), in my case it is renderD128

docker run -itd --device /dev/kfd --device /dev/dri/renderD128 -v $(pwd):/workspace squash

Usage steps:

Run python3 ./squash/trainer.py to run some quick unit tests.
Run python3 ./squash/benchmark.py 0.9 to benchmark 90% sparse fixed-fan in kernels on OPT-350M vs. dense benchmarks.
To reproduce plot, run python3 ./squash/benchmark.py 0.9 && python3 ./squash/benchmark.py 0.95 && python3 ./squash/benchmark.py 0.99 and then run all cells in ./notebooks/plots.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.vscode		.vscode
notebooks		notebooks
rocm_torch_extension		rocm_torch_extension
squash		squash
third-party		third-party
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback