Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Defect: Improve the caching policy when the RemoteData is cleaned up #13

Open
unkcpz opened this issue Mar 7, 2023 · 0 comments
Open
Labels
roadmap/proposed A roadmap item that has been proposed but not yet processed

Comments

@unkcpz
Copy link
Member

unkcpz commented Mar 7, 2023

Motivation

I ran my calculation on Eiger, which has a limited remote files handler quota of 50K files per user. Due to the nature of my calculation, it quickly generates numerous small files, and with just five work chains, the file handler quota is exhausted. To submit more work chains, I need to use the clearworkdir function provided by AiiDA to dynamically clear the remote folder once a calculation is finished.

Initially, this approach works well without the caching feature. However, when caching is enabled, unexpected workflow failures occur.

Desired Outcome

Users can turn on the caching and clean the remote work folder of calculations as they want without sophistically changing there workflow logic to work around the issue.

Impact

The users who are cleaning remote work directory in their workflow with the caching feature turned on will benefit from this improvement.

Complexity

The following issues need to be solved:

  • Shallow copy on RemoteData.clone()
  • Invalidate cache after RemoteData.clean()
  • Hash calculation of RemoteData
  • Prospective workchain caching

We don't allow the caching of the workflow but only the calcjobs are cached in the design. What we want to have here is actually allow at some point to cache the work chain. If the design needs to be changed then one month of work is required to properly tackle the problem.

Progress

The AEP is made on aiidateam/AEP#35, which is a draft since more details are to be discussed.

In the aiida-core 2.x, it supported control the caching for the single process by setting the attribute Node.is_valid_cache which introduced in aiidateam/aiida-core#5207

@unkcpz unkcpz added the roadmap/proposed A roadmap item that has been proposed but not yet processed label Mar 7, 2023
@unkcpz unkcpz mentioned this issue Mar 7, 2023
27 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
roadmap/proposed A roadmap item that has been proposed but not yet processed
Projects
None yet
Development

No branches or pull requests

1 participant