Fix a bug in flash attention where kv_seq_len should divide block_k_major. #10961
Job | Run time |
---|---|
2s | |
1h 11m 31s | |
1h 14m 2s | |
1m 44s | |
8m 55s | |
18m 15s | |
5m 49s | |
11m 16s | |
12m 47s | |
5m 42s | |
6m 6s | |
3h 36m 9s |
Job | Run time |
---|---|
2s | |
1h 11m 31s | |
1h 14m 2s | |
1m 44s | |
8m 55s | |
18m 15s | |
5m 49s | |
11m 16s | |
12m 47s | |
5m 42s | |
6m 6s | |
3h 36m 9s |