[Kernel] Add back batch size 1536 and 3072 to MoE tuning #5242

WoosukKwon · 2024-06-04T08:07:48Z

This PR adds back the batch size 1536 and 3072 in moe tuning. The batch sizes were mistakenly omitted during the refactoring #4921.

pcmoritz

Thanks for the fix!

…t#5242)

[Kernel] Add back batch size 1536 and 3072 to MoE tuning

47069e5

WoosukKwon marked this pull request as ready for review June 4, 2024 08:07

WoosukKwon mentioned this pull request Jun 4, 2024

[Kernel] Re-tune Mixtral MoE configurations for FP8 on H100 #5238

Merged

WoosukKwon requested a review from pcmoritz June 4, 2024 08:13

comaniac approved these changes Jun 4, 2024

View reviewed changes

pcmoritz approved these changes Jun 4, 2024

View reviewed changes

WoosukKwon merged commit 27208be into main Jun 4, 2024
52 of 69 checks passed

WoosukKwon deleted the missed-batch-sizes branch June 4, 2024 16:58

blinkbear pushed a commit to blinkbear/vllm that referenced this pull request Jun 6, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

73eaaf6

…t#5242)

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 11, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

789553f

…t#5242)

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

bdbb931

…t#5242)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

6a592bd

…t#5242)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

dd18fb3

…t#5242)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

8bf1dc7

…t#5242)

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[Kernel] Add back batch size 1536 and 3072 to MoE tuning (vllm-projec…

cacb366

…t#5242)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Kernel] Add back batch size 1536 and 3072 to MoE tuning #5242

[Kernel] Add back batch size 1536 and 3072 to MoE tuning #5242

WoosukKwon commented Jun 4, 2024

pcmoritz left a comment

[Kernel] Add back batch size 1536 and 3072 to MoE tuning #5242

[Kernel] Add back batch size 1536 and 3072 to MoE tuning #5242

Conversation

WoosukKwon commented Jun 4, 2024

pcmoritz left a comment

Choose a reason for hiding this comment