Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) #4888
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) #4888
Changes from 250 commits
2d7e081
845f040
849e49c
a80325d
8b38776
f39c313
90b5a0e
9ee2582
5d0ac23
1bcc949
5ae5969
1bece71
1d882ca
dfd9469
a2e5465
3c3687e
6f07c77
64e71e1
481c646
9c8e19d
ed17ee3
d630aa8
b664806
611df43
8ee49dd
6f4b49e
039c25e
8e9ef5b
2b59ddc
471569f
0ad9d6a
6f9ab7d
19d1ca5
700b6dc
76c639a
882640e
584297e
1de7077
a89c7c6
0bbd0db
af998ca
78c678a
0b6b2e9
641f431
c7f5490
9c78f85
bf93a9e
cd759f2
1af3625
eb5cf0c
afcb42e
d13e08e
ab92fb0
582a0f5
a20be6d
a9a162d
6e3cfe1
622ce09
4d88a89
3dfcb55
6960723
104a8aa
31275cc
7b2374c
a643436
a0e1a2a
2a1d84a
f6e0310
60c01c3
5c94166
eaa627f
9c597c4
9831ce6
180ba26
b7c1e84
faf9118
61d63bd
b9b6048
e2e2082
b223873
2ea335c
02875ab
79f307d
e790a00
aae601b
d871c9f
90d5c0d
a973c2b
d8d284e
27dc095
2445905
8dabdc2
c6200e6
0dc197b
3d3c04f
c132caa
39ee51a
c41917b
e738fb4
dfe9c10
8221758
98fcb64
db2b2d2
cbb89b1
33b598a
c9ce86b
ca570e7
bf88882
b2e131f
ed8f8b3
c944cc1
bea9e01
8abe51c
fec833e
8bd5280
da1b648
eefd588
60a21e3
306ea5b
eda2273
9425f0c
62fb8d1
2730daa
5face2a
f39155a
51144ad
3f87f37
c7edbc6
b023557
9a359b3
af0c0b9
50bca08
90610da
a006cc8
20b95b0
6d52d60
9d7ebb3
a81712b
d35ea41
27782df
c3a2e7a
8babfda
ce2422b
0045051
91eb067
a6aee80
083c205
ea09789
ee51260
50a45cc
cd0a1aa
ec5977d
196d4b1
1f7b2eb
76b0b9e
aa5363a
5351416
8d390a0
68b6d4b
5923002
c3e5d2a
739ab3c
a2a451f
2265224
d72aaa9
1c19d36
e10340d
67ab576
4ae1b8a
dc7d3c8
bab33c3
e9c2a85
5791028
28a9e76
0a65267
f0cd5ea
2a7fd86
286489c
8e1daa1
a2a7ac5
dfc96c5
d357568
0285548
97cad0b
29fa1af
488c0fa
c9f11ff
196e671
03e5d81
1f3874d
528b4a7
708a4b3
f06c687
b3c3411
4dccd51
e229e00
90aec38
4758680
addde7d
d0fd9e1
7b9cb7f
c3f7da7
5f8c7f6
525303c
91cbaa6
ea37e17
67ed419
e9d7ede
ca68c63
ce88fa3
5ce2dd0
125e5dc
597526a
a178b7a
06c7f75
47c9f39
2f0b05b
d23c284
1a6e5a3
e2a46e3
4dabe19
7ca0d7a
c24697f
75756b9
bcccc34
c8f8d59
a501849
83d474e
64981b5
8d36458
5ff9c76
2828aa7
65e47db
2f0eb9b
d81662c
13f5b50
5dbebbc
07df0e1
7e0bc57
e837a73
7ce9a51
9ae6728
a1bf652
4f27946
5ee30fe
45fc9f7
097aff2
d8a692b
5df73fc
6cd595c
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing