Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how deepspeed can avoid doing all_reduce? #6690

Closed
luuck opened this issue Oct 30, 2024 · 1 comment
Closed

how deepspeed can avoid doing all_reduce? #6690

luuck opened this issue Oct 30, 2024 · 1 comment
Assignees

Comments

@luuck
Copy link

luuck commented Oct 30, 2024

I set something in DeepSpeed in order to bypass being all-reduced on all model parameters.

In order words, below is exactly the parameter list I need to bypass doing allreduce:
bypass_list = [x for x in model.parameters() if getattr(param, '_tutel_expert', False) == True]

how DeepSpeed can avoid doing all_reduce for bypass_list above?

@jomayeri
Copy link
Contributor

There is an outstanding PR that may be of use: #6675

@jomayeri jomayeri self-assigned this Nov 14, 2024
@jomayeri jomayeri closed this as completed Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants