Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Native multi-GPU training requires pytorch>=1.9.1 #1199

Closed
sailist opened this issue Mar 16, 2023 · 3 comments · Fixed by #1203
Closed

Native multi-GPU training requires pytorch>=1.9.1 #1199

sailist opened this issue Mar 16, 2023 · 3 comments · Fixed by #1203

Comments

@sailist
Copy link

sailist commented Mar 16, 2023

System Info

I encountered an error while running a multi-GPU training program using accelerate. The `accelerate launch scripts.py` raise an NotImplementException


Native multi-GPU training requires pytorch>=1.9.1
  • Have the developers abandoned support for PyTorch with version lower than that?
  • If there are any workarounds that can be used to continue using PyTorch 1.8 with accelerate to use multi-GPU training.
torch                    1.8.1+cu101
accelerate               0.16.0


### Information

- [ ] The official example scripts
- [X] My own modified scripts

### Tasks

- [ ] One of the scripts in the examples/ folder of Accelerate or an officially supported `no_trainer` script in the `examples` folder of the `transformers` repo (such as `run_no_trainer_glue.py`)
- [ ] My own task or dataset (give details below)

### Reproduction

In any environment with pytorch version lower than 1.9.1

### Expected behavior

```Shell
Run without any exception.
@muellerzr
Copy link
Collaborator

Hi, it's the CLI that requires >=1.9.1, please use torch.distributed.launch directly through the CLI to launch the code using Accelerate

@Chen-Suyi
Copy link

Hi, it's the CLI that requires >=1.9.1, please use torch.distributed.launch directly through the CLI to launch the code using Accelerate

Does it mean accelerate does not support torch with version < 1.9.1?

@muellerzr
Copy link
Collaborator

Just accelerate launch specifically on multi GPU. You can launch with torchrun and use accelerate code that way

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants