-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] support ep for DeepSeek V3 #2740
Comments
Hi @zhyncs , I'd be more than happy to take a look! :) |
Thanks!! |
@zhyncs, When can this task be completed? Is there an approximate time? |
Hi @zhyncs we're working on it too. Will release the available codes asap and create a PR. |
Hi @xinji1 Thanks! Please join the slack channel https://slack.sglang.ai |
FYI The president of Meituan @sleepcoo will take over this feature. Cheers! |
expect this feature |
On the way on the way |
Checklist
Motivation
The code for EP and block wise FP8 required by V3 is available separately. The task is to integrate block wise FP8 into the current DeepSeek V2 EP, based on the previous integration of Fused MoE with block wise FP8.
ref
https://github.com/sgl-project/sglang/tree/main/python/sglang/srt/layers/moe/ep_moe
#2575
Related resources
No response
The text was updated successfully, but these errors were encountered: