memory available in computer nodes #4297

cpignedoli · 2020-08-10T16:42:50Z

in apps/workchains where teh number of computer nodes to be used for a calculation
is determined automatically, it could be useful to know the memory available per node

add memory per node or memory per core in the definition of a new computer

greschd · 2020-08-10T17:03:33Z

One complication here are heterogeneous clusters: memory per node is not always a constant across a single computer.

cpignedoli · 2020-08-10T17:11:00Z

Dear Dominik you are right. Despite in many cases nodes belong to different “queues” according to the characteristics of the node, this cannot be assumed as general case. The only workaround I see to this problem that I did not have in mind is then to define, for a computer, max and min possible memory per node. Cheers Carlo

…

On 10 Aug 2020, at 19:03, Dominik Gresch ***@***.***> wrote: One complication here are heterogeneous clusters: memory per node is not always a constant across a single computer. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#4297 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFPEIONZLCBLWDNDIPUGBUDSAAR7HANCNFSM4P2ERDFA>.

greschd · 2020-08-10T17:32:04Z

Yeah, there might absolutely be value in allowing for this even if we don't have a complete solution for the heterogeneous case. For "num_mpiprocs_per_machine" we can also currently only set one default.

I think there are two separate but related issues here:

How to specify resources for the heterogeneous case. It probably makes sense to solve this is a consistent way for memory + CPU. The queue seems like the most straightforward way to distinguish them, but I don't know if there might be cases where even a single queue can have different nodes.
Adding memory per node: While the previous point is unresolved, a single value per computer still seems valuable. We should just make sure we don't make it harder than necessary to later extend the functionality.

This is a temporary fix that let's correctly estimate the number of nodes for the CSCS supercomputers. The issue here is different nodes have more or less the same amount of RAM, so the number of CPUs can't be used as a criterion to select the number of nodes. Once the following issue is fixed: aiidateam/aiida-core#4297, we can implement a better mechanism to auto-select the number of nodes. Co-authored-by: Carlo Pignedoli <carlo.pignedoli@empa.ch>

cpignedoli added the type/feature request status undecided label Aug 10, 2020

greschd added the topic/computers label Aug 12, 2020

yakutovicha mentioned this issue Aug 23, 2020

Fix nnodes estimation. nanotech-empa/aiida-nanotech-empa#5

Merged

yakutovicha closed this as completed in nanotech-empa/aiida-nanotech-empa#5 Aug 23, 2020

yakutovicha reopened this Aug 23, 2020

yakutovicha added this to the v2.0.0 milestone May 5, 2021

ramirezfranciscof added the priority/nice-to-have label Aug 10, 2021

sphuber modified the milestones: v2.0.0, Post 2.0 Sep 22, 2021

yakutovicha mentioned this issue Dec 8, 2021

Add default_memory_per_machine attribute to Computer. #5260

Merged

yakutovicha self-assigned this Dec 8, 2021

sphuber closed this as completed in #5260 Dec 15, 2021

sphuber modified the milestones: Post 2.0, v2.0.0 Dec 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory available in computer nodes #4297

memory available in computer nodes #4297

cpignedoli commented Aug 10, 2020

greschd commented Aug 10, 2020

cpignedoli commented Aug 10, 2020 via email

greschd commented Aug 10, 2020

memory available in computer nodes #4297

memory available in computer nodes #4297

Comments

cpignedoli commented Aug 10, 2020

greschd commented Aug 10, 2020

cpignedoli commented Aug 10, 2020 via email

greschd commented Aug 10, 2020