You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the default value of clean_workdir is set to True for all workflow protocols. I'm wondering if we should use False as the default instead (which is also the default used when specifying the input). Although it is often desirable to clean the working directory, the fact that is the default may be confusing and/or frustrating for new users. Example use cases:
Use case A
A user has run an scf calculation using the PwBaseWorkChain (as should be recommended), and wants to run a bands calculation on top. Putting the scfremote_folder as the parent_folder of the bands run won't work, since the default is to clean the working directory and the charge density is (sensibly) not retrieved. The user might even try this, but it will fail because the directory is not there.
Use case B
The user has run a bunch of scf calculations using the PwBaseWorkChain, and now wants to do some manual operations, e.g.:
Explore some of the output files after having learned of verdi calcjob gotocomputer.
Store some non-retrieved files (e.g. charge densities, wave functions) on a persistent partition of the computer.
Since the default is to clean the working directory, these files are lost, and the user might not immediately even understand why.
Conclusion
I think when setting defaults, it's best to err on the side of caution. Unless the user is running a lot of calculations, not cleaning the working directory has little side effects and can be performed manually. So I would set the clean_workdir default to False in the protocols.
The text was updated successfully, but these errors were encountered:
Currently, the default value of
clean_workdir
is set toTrue
for all workflow protocols. I'm wondering if we should useFalse
as the default instead (which is also the default used when specifying theinput
). Although it is often desirable to clean the working directory, the fact that is the default may be confusing and/or frustrating for new users. Example use cases:Use case A
A user has run an
scf
calculation using thePwBaseWorkChain
(as should be recommended), and wants to run abands
calculation on top. Putting thescf
remote_folder
as theparent_folder
of thebands
run won't work, since the default is to clean the working directory and the charge density is (sensibly) not retrieved. The user might even try this, but it will fail because the directory is not there.Use case B
The user has run a bunch of
scf
calculations using thePwBaseWorkChain
, and now wants to do some manual operations, e.g.:verdi calcjob gotocomputer
.Since the default is to clean the working directory, these files are lost, and the user might not immediately even understand why.
Conclusion
I think when setting defaults, it's best to err on the side of caution. Unless the user is running a lot of calculations, not cleaning the working directory has little side effects and can be performed manually. So I would set the
clean_workdir
default toFalse
in the protocols.The text was updated successfully, but these errors were encountered: