Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: crash when output directory doesn't exist #364

Merged

Conversation

HarikrishnanBalagopal
Copy link
Contributor

Description of the change

Fixes crash in #359

Related issue number

#359

How to verify the PR

Run a multi GPU training with a non-existent output dir.

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

Copy link

github-actions bot commented Oct 2, 2024

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

@kmehant
Copy link
Collaborator

kmehant commented Oct 3, 2024

FYA - @anhuong #359 (comment)

Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com>
@Abhishek-TAMU
Copy link
Collaborator

If race condition is already tested as per this, then it looks good to me.

@anhuong
Copy link
Collaborator

anhuong commented Oct 4, 2024

Abhishek additionally tested this in the image with the accelerate_launch.py script which worked nicely as well

@anhuong anhuong merged commit 63d0c5d into foundation-model-stack:main Oct 4, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants