Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in large parallel runs #171

Closed
jryates opened this issue Mar 23, 2018 · 3 comments
Closed

Error in large parallel runs #171

jryates opened this issue Mar 23, 2018 · 3 comments
Assignees

Comments

@jryates
Copy link
Member

jryates commented Mar 23, 2018

Error reported to developers whereby Wannier90 crashes in wannierise routines when run in parallel for largest (ish) systems

@jryates jryates self-assigned this Mar 23, 2018
@hjunlee
Copy link
Contributor

hjunlee commented Mar 27, 2018

Dear Prof. Jonathan R. Yates:

This has already been discussed in my previous commit:
#120

The main source of this problem is that currently most of the data (large matrices) is not distributed. In my personal version, I worked around this problem partially.

If you want, I will align the part of my modified subroutines to the current version of Wannier90 and submit them.

Sincerely,

Hyungjun Lee
EPFL

@jryates
Copy link
Member Author

jryates commented Mar 27, 2018

I think your earlier comment referred to postw90, and this is the main wannier90 - but I wouldn't be surprised if this was the same issue. I know that I need to take the time to think about how memory is managed within the code.
As well as other changes, it would make a lot of sense to make use of shared memory using something like Ian Bush's FreeIPC.f90
If you have time to update your code to the latest version, I would be happy to take a look.
Thanks!

@hjunlee
Copy link
Contributor

hjunlee commented Mar 27, 2018

Dear Prof. Jonathan R. Yates:

I submitted the part of modifications done in my personal version (#173).
They are related to the distribution of large matrices in W90.
u_matrix and u_matrix_opt are also possible, but I didn't apply in this commit since it is more complex to align it into the recent version of W90 than I expected.

I am not sure whether this can solve the problem you mentioned since I don't know the problem.
But, this can solve some cases which lead to the memory shortage.

PS)

As well as other changes, it would make a lot of sense to make use of shared memory using something like Ian Bush's FreeIPC.f90

It is very interesting! Thank you for the info.

Sincerely,

Hyungjun Lee
EPFL

@jryates jryates closed this as completed in da4b9bd Apr 3, 2018
manxkim pushed a commit to manxkim/wannier90 that referenced this issue Jan 10, 2021
distribution of some large matrices in parallel run.  Fixes wannier-developers#171
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants