Activity
Adding new env to lit tests
Adding new env to lit tests
Adding env var for enabling 2 dots pipelining, disabled by default
Adding env var for enabling 2 dots pipelining, disabled by default
Update the comment
Update the comment
Fix for extra barrier
Fix for extra barrier
Fix for colliding tmem allocation boundaries
Fix for colliding tmem allocation boundaries
Fix for colliding tmem allocation boundaries
Fix for colliding tmem allocation boundaries
Tweak assign latencies heuristic to include data fed into tmem_store.…
Tweak assign latencies heuristic to include data fed into tmem_store.…
Disable mma pipelining when there is >1 dot in the loop
Disable mma pipelining when there is >1 dot in the loop
Fixing buffering issue on hopper
Fixing buffering issue on hopper
Actually adding the files...
Actually adding the files...
Bringing back some deleted files to make mergeing easier after landin…
Bringing back some deleted files to make mergeing easier after landin…
Merge branch 'main' into pawel/lower_mmav5
Merge branch 'main' into pawel/lower_mmav5
[TUTORIALS] Bypass tma_ws
in the tutorial when the GPU doesn't supp…
[TUTORIALS] Bypass
tma_ws
in the tutorial when the GPU doesn't supp…Reapply "Removing keep_tmem_in_acc"
Reapply "Removing keep_tmem_in_acc"
Merge branch 'main' into pawel/lower_mmav5
Merge branch 'main' into pawel/lower_mmav5
Fixing num stages for tma descriptor lowering
Fixing num stages for tma descriptor lowering
Merge branch 'main' into pawel/lower_mmav5
Merge branch 'main' into pawel/lower_mmav5
[LAYOUTS] Kill getWarpsPerCTA(Attribute) and prefer LinearLayout-base…
[LAYOUTS] Kill getWarpsPerCTA(Attribute) and prefer LinearLayout-base…
Fixing number of tma descriptor buffers
Fixing number of tma descriptor buffers
removing debug change in tests
removing debug change in tests
Cleanup around pipeline utils
Cleanup around pipeline utils
Removing keep_tmem_in_acc
Removing keep_tmem_in_acc