You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
OpenMPI main branch
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
Built from current main branch (3/22/22)
If you are building/installing from a git clone, please copy-n-paste the output from git submodule status.
git submodule status
1b86a35db2816ee9c0f3a41988005a2ba7d29adb 3rd-party/openpmix (v1.1.3-3481-g1b86a35d)
91f791e209ccbdfb4b8647900d292ef51d52f37d 3rd-party/prrte (psrvr-v2.0.0rc1-4319-g91f791e2)
Please describe the system on which you are running
Operating system/version:
RHEL 8.4
Computer hardware:
Single Power8 node
Network type:
Localhost
Details of the problem
I ran the set of self-checking tests from ompi-tests-public/collective-big-count with collective components specified as --mca coll ftagree,basic,sm,self,inter,libnbc
The following testcases had failures. The remaining testcases were successful:
test_allgather_uniform_count
test-alltoall-uniform_count
test-gather-uniform-count
test-scatter-uniform-count
The tests were compiled by running make in the directory containing the source files
The following environment variables were set for all tests:
These errors look like the same errors I saw with the tuned collective component (#10190). I don't know if these are failures are in common code or whether they are triggered by same or similar problems in the different component's code
This command fails with a self-check error message followed by a SIGSEGV in MPI_Wait
Thank you for taking the time to submit an issue!
Background information
What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
OpenMPI main branch
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
Built from current main branch (3/22/22)
If you are building/installing from a git clone, please copy-n-paste the output from
git submodule status
.Please describe the system on which you are running
Details of the problem
I ran the set of self-checking tests from ompi-tests-public/collective-big-count with collective components specified as --mca coll ftagree,basic,sm,self,inter,libnbc
The following testcases had failures. The remaining testcases were successful:
The tests were compiled by running make in the directory containing the source files
The following environment variables were set for all tests:
These errors look like the same errors I saw with the tuned collective component (#10190). I don't know if these are failures are in common code or whether they are triggered by same or similar problems in the different component's code
This command fails with a self-check error message followed by a SIGSEGV in MPI_Wait
This is the self-check error message and traceback
This command fails with a self-check error message followed by a SIGSEGV in MPI_Wait
This is the self-check error message and traceback
This command fails with a self-check error message followed by a double free or storage
corruption.
This is the self-check error message and the traceback
This is the self-check error message
The text was updated successfully, but these errors were encountered: