Skip to content

Activity

Adding new env to lit tests

pawelszczerbukpushed 1 commit to pawel/mmav5_2dots_2 • d1120b0…24438f2 • 
4 days ago

[AMD] NFC: Replace intrinsics for ballot and readlane with ROCDL ops (t…

pawelszczerbukpushed 7 commits to main • 6fa33ef…15a4d66 • 
4 days ago

Adding env var for enabling 2 dots pipelining, disabled by default

pawelszczerbukpushed 1 commit to pawel/mmav5_2dots_2 • 622e66a…d1120b0 • 
4 days ago

Update the comment

pawelszczerbukpushed 1 commit to pawel/mmav5_2dots_2 • 58816c7…622e66a • 
4 days ago

Fix for extra barrier

pawelszczerbukpushed 1 commit to pawel/mmav5_2dots_2 • 6f2bd71…58816c7 • 
4 days ago

lit tests updated

pawelszczerbukcreated pawel/mmav5_2dots_2 • 6f2bd71 • 
5 days ago

[TEST] force num_warps to be an int (triton-lang#6319)

pawelszczerbukpushed 5 commits to main • 5ce3754…6fa33ef • 
5 days ago

Fix for colliding tmem allocation boundaries

pawelszczerbukcreated pawel/mmav5_2dots • 4869842 • 
5 days ago

Fix for colliding tmem allocation boundaries

pawelszczerbukcreated pawel/tmem_alloc_collission • 78361f4 • 
6 days ago

Fix typing of do_not_specialize kwarg of triton.jit (triton-lang#…

pawelszczerbukpushed 13 commits to main • 3121ad5…5ce3754 • 
6 days ago

Merge cleanup

pawelszczerbukpushed 11 commits to pawel/lower_mmav5 • 77918ac…90bfd50 • 
7 days ago

[AMD] Enable packed Bf8/Fp8->Bf16 conversions for gfx950 (triton-lang…

pawelszczerbukpushed 9 commits to main • 484b9c6…3121ad5 • 
7 days ago

Tweak assign latencies heuristic to include data fed into tmem_store.…

pawelszczerbukpushed 2 commits to pawel/lower_mmav5 • dc2e9bd…77918ac • 
7 days ago

Disable mma pipelining when there is >1 dot in the loop

pawelszczerbukpushed 2 commits to pawel/lower_mmav5 • da42599…dc2e9bd • 
8 days ago

Fixing buffering issue on hopper

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 4f6a3dd…da42599 • 
8 days ago

Actually adding the files...

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • e8bb223…4f6a3dd • 
8 days ago

Bringing back some deleted files to make mergeing easier after landin…

pawelszczerbukpushed 2 commits to pawel/lower_mmav5 • 8cf8c08…e8bb223 • 
8 days ago

Merge branch 'main' into pawel/lower_mmav5

pawelszczerbukpushed 10 commits to pawel/lower_mmav5 • 0a5f2cb…8cf8c08 • 
8 days ago

[TUTORIALS] Bypass tma_ws in the tutorial when the GPU doesn't supp…

pawelszczerbukpushed 12 commits to main • 593a1b5…484b9c6 • 
8 days ago

Reapply "Removing keep_tmem_in_acc"

pawelszczerbukpushed 5 commits to pawel/lower_mmav5 • df1a473…0a5f2cb • 
8 days ago

Merge branch 'main' into pawel/lower_mmav5

pawelszczerbukpushed 4 commits to pawel/lower_mmav5 • ae5f15c…df1a473 • 
11 days ago

Fixing num stages for tma descriptor lowering

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 23698af…ae5f15c • 
11 days ago

Merge branch 'main' into pawel/lower_mmav5

pawelszczerbukpushed 11 commits to pawel/lower_mmav5 • 0d44115…23698af • 
11 days ago

[LAYOUTS] Kill getWarpsPerCTA(Attribute) and prefer LinearLayout-base…

pawelszczerbukpushed 10 commits to main • 4904034…593a1b5 • 
11 days ago

fixing lit

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 38133d6…0d44115 • 
11 days ago

Fixing number of tma descriptor buffers

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 5eafb1d…38133d6 • 
12 days ago

removing debug change in tests

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 75ab8bc…5eafb1d • 
12 days ago

Cleanup around pipeline utils

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 22d04a2…75ab8bc • 
12 days ago

Stale env var

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 4e0c2a2…22d04a2 • 
12 days ago

Removing keep_tmem_in_acc

pawelszczerbukpushed 1 commit to pawel/lower_mmav5 • 65f0a1c…4e0c2a2 • 
12 days ago