-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Appears to be a tmp folder persisting? #401
Comments
@Desperate-Dan can you send the pangolin command line that you're running? Where is the tmp folder created? And can you send the last ~10 lines of pangolin output? Thanks. |
So that looks like everything that would be output from the preprossessing.smk pipeline, I can't seem to replicate this as the directory is cleared as expected on my system. If you try running in verbose mode and see where it says your tempdir lives it might give a clue? |
I have the same issue and worked around it by clearing at end of pipeline. This is the case for all types of runs, including successful one. I can confirm it fills the disk if no manual cleanup is done. I run pangolin with |
That's very interesting @rgerhards, everything should get wrtten to a tempdir that gets cleared automatically upon completion (uses python tempdir module). Are you still seeing all the intermediate files? |
I need to re-run as soon as the current run is through (as I said, the pipeline clears temp files out). Will happily do that, but will take me at least until tomorrow morning to provide more info. |
I can confirm the issue. file system after run:
one log file:
pangolin call:
Versions:
I'll update to 4.0.x soon. |
Update: same with 4.0.2 I now also have a scorpio log file (with no helpful info as far as I can see). |
Apologies for the delay, I've done some tweaking and I believe my issue was being caused by running pangolin in the /tmp directory within a docker container. Switching to another directory in my container for running pangolin means that the pangolin /tmpXXXXX dir is produced in the container's /tmp dir, so isn't maintained when the container shuts down. If I keep the container running though I can go to /tmp dir in the container and see the pangolin /tmpXXXXX dir so it is persisting after the command. This isn't really an issue for me as my container is continually restarted, but it is something that's changed since the update. Perhaps this is now a permissions issue within my container? I'll investigate further. I've also attached the full output from the pangolin command @AngieHinrichs. Thanks! |
For me, it is a regular VM install, so no container involved. I was a bit lazy and just did a "The user of mkdtemp() is responsible for deleting the temporary directory and its contents when done with it." Source: https://docs.python.org/3/library/tempfile.html#tempfile.mkdtemp Might this be the root issue? My apologies if I did look at the wrong spots (a full code review is unfortunately out of scope for me). Thanks again for all your great work! |
Oh, indeded. see commit: d30284b As far as I read it, it changed Edit info: used wrong sample, but commit was OK, so collapsing this to relevant piece of info. |
I think that is the source of the issue you're right, I've added in a cleanup step to the current dev branch and will just see if it passes tests! |
Resolved in pangolin v4.0.3! |
Hello! I've run pangolin a couple of times since the update and there appears to be a tmp folder that persists after completion that contains a lot of what look to me like intermediate files:
It's always called tmpXXXXXXXX which changes each time (eg above is tmprk5ptk5p) so potentially just a missing / somewhere?
The text was updated successfully, but these errors were encountered: