Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spot launch doesn't respect .gitignore #1898

Closed
keirp opened this issue Apr 25, 2023 · 6 comments · Fixed by #2018
Closed

Spot launch doesn't respect .gitignore #1898

keirp opened this issue Apr 25, 2023 · 6 comments · Fixed by #2018
Assignees
Labels
enhancement New feature or request feature-request
Milestone

Comments

@keirp
Copy link

keirp commented Apr 25, 2023

When launching an instance with sky launch, files included in the my .gitignore file are ignored during the rsync. However, when I launch a job with sky spot launch, it seems as though the files are included. Is this intended behavior?

@Michaelvll
Copy link
Collaborator

Michaelvll commented Apr 25, 2023 via email

@concretevitamin
Copy link
Member

cc @romilbhardwaj

Another idea is to read .gitignore and pass each line as an --exclude arg to the aws s3 sync command: https://github.com/skypilot-org/skypilot/blob/master/sky/data/storage.py#L1711

@concretevitamin concretevitamin added this to the Storage milestone Apr 26, 2023
@concretevitamin
Copy link
Member

Hit this myself. Current workdir has a bunch of "results" non-code files that should not be uploaded. Now every spot launch is taking quite a while just for uploading these. An important UX issue.

@landscapepainter
Copy link
Collaborator

I'll take on this one.

@landscapepainter
Copy link
Collaborator

@concretevitamin Would you mind elaborating on what you meant by 'bunch of "results" non-code files'? I haven't encountered anything odd that got uploaded to the spot instance while running spot launch

@romilbhardwaj
Copy link
Collaborator

I think he was just providing an example - he has a some non-code files and dirs (e.g., experiment results) that are large and included in his gitignore, but they still get uploaded when running sky spot launch. If we parse .gitignore correctly when uploading storage objects, these files should not get uploaded.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment