Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#1668: initialize collection stats phase #2185

Merged
merged 8 commits into from
Oct 18, 2023

Conversation

nmm0
Copy link
Collaborator

@nmm0 nmm0 commented Aug 15, 2023

Fixes #1668

@github-actions
Copy link

github-actions bot commented Aug 15, 2023

Pipelines results

PR tests (gcc-12, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-9, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (gcc-9, ubuntu, mpich, zoltan, json schema test)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-12, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-13, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (gcc-10, ubuntu, openmpi, no LB)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-11, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-14, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (gcc-8, ubuntu, mpich, address sanitizer)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (clang-10, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (gcc-11, ubuntu, mpich, trace runtime, coverage)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (intel icpc, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

Compilation - successful

Testing - passed

Build log


PR tests (nvidia cuda 11.2, gcc-9, ubuntu, mpich)

Build for 2b39b4a (2023-10-18 13:26:11 UTC)

/vt/src/vt/pipe/pipe_manager.impl.h(133): warning: missing return statement at end of non-void function "vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&vt::vrt::collection::lb::GreedyLB::collectHandler, Target=vt::objgroup::proxy::ProxyElm<vt::vrt::collection::lb::GreedyLB>]"
          detected during:
            instantiation of "auto vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&vt::vrt::collection::lb::GreedyLB::collectHandler, Target=vt::objgroup::proxy::ProxyElm<vt::vrt::collection::lb::GreedyLB>]" 
/vt/src/vt/objgroup/proxy/proxy_objgroup.impl.h(154): here
            instantiation of "vt::objgroup::proxy::Proxy<ObjT>::PendingSendType vt::objgroup::proxy::Proxy<ObjT>::reduce<f,Op,Target,Args...>(Target, Args &&...) const [with ObjT=vt::vrt::collection::lb::GreedyLB, f=&vt::vrt::collection::lb::GreedyLB::collectHandler, Op=vt::collective::PlusOp, Target=vt::objgroup::proxy::ProxyElm<vt::vrt::collection::lb::GreedyLB>, Args=<vt::vrt::collection::lb::GreedyPayload>]" 
/vt/src/vt/vrt/collection/balance/greedylb/greedylb.cc(222): here

/vt/src/vt/pipe/pipe_manager.impl.h(133): warning: missing return statement at end of non-void function "vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&MyObj::handler, Target=vt::objgroup::proxy::ProxyElm<MyObj>]"
          detected during instantiation of "auto vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&MyObj::handler, Target=vt::objgroup::proxy::ProxyElm<MyObj>]" 
/vt/examples/callback/callback.cc(147): here

/vt/src/vt/pipe/pipe_manager.impl.h(133): warning: missing return statement at end of non-void function "vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&colHan, Target=vt::vrt::collection::VrtElmProxy<MyCol, vt::Index1D>]"
          detected during instantiation of "auto vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&colHan, Target=vt::vrt::collection::VrtElmProxy<MyCol, vt::Index1D>]" 
/vt/examples/callback/callback.cc(153): here

/vt/src/vt/pipe/pipe_manager.impl.h(133): warning: missing return statement at end of non-void function "vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&MyObj::handler, Target=vt::objgroup::proxy::ProxyElm<MyObj>]"
          detected during instantiation of "auto vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&MyObj::handler, Target=vt::objgroup::proxy::ProxyElm<MyObj>]" 
/vt/examples/callback/callback.cc(147): here

/vt/src/vt/pipe/pipe_manager.impl.h(133): warning: missing return statement at end of non-void function "vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&colHan, Target=vt::vrt::collection::VrtElmProxy<MyCol, vt::Index1D>]"
          detected during instantiation of "auto vt::pipe::PipeManager::makeSend<f,Target>(Target) [with f=&colHan, Target=vt::vrt::collection::VrtElmProxy<MyCol, vt::Index1D>]" 
/vt/examples/callback/callback.cc(153%0D%0A%0D%0A%0D%0A ==> And there is more. Read log. <==

Build log


@nmm0
Copy link
Collaborator Author

nmm0 commented Sep 21, 2023

There appear to be some issues with this implementation that are not caught by new or existing tests, but show up in one of my application runs. So I'm investigating what the cause is.

@thearusable thearusable force-pushed the 1668-initialize-collection-stats-phase branch from 367b88e to cf8cc15 Compare October 10, 2023 14:20
@thearusable thearusable marked this pull request as ready for review October 10, 2023 14:41
@thearusable
Copy link
Contributor

There appear to be some issues with this implementation that are not caught by new or existing tests, but show up in one of my application runs. So I'm investigating what the cause is.

@nmm0 Were you able to identify the issue?

@thearusable thearusable marked this pull request as draft October 10, 2023 16:25
@nmm0
Copy link
Collaborator Author

nmm0 commented Oct 11, 2023

@nmm0 Were you able to identify the issue?

Hey @thearusable sorry, I haven't had a chance, I've been out sick for a little while. I'll be looking into it but the main thing is that on one of the NimbleSM application runs, it eventually fails with a mismatched phase number. I'm not sure what the exact cause of it is, but I think it may be an inserted element not getting its phase number set correctly (I did remove that logic from the insert function but thought it would be covered in the other function it got moved to).

@thearusable thearusable self-assigned this Oct 18, 2023
@thearusable thearusable marked this pull request as ready for review October 18, 2023 13:25
@thearusable thearusable force-pushed the 1668-initialize-collection-stats-phase branch from 3177b3d to 2b39b4a Compare October 18, 2023 13:26
@lifflander lifflander merged commit 6833a5f into develop Oct 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

collection stats phase not initialized correctly
3 participants