Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

segfaults in ThreadSafeLogMessageLoggerScribe on slc7_ppc64le_gcc9 #33636

Open
dan131riley opened this issue May 5, 2021 · 36 comments
Open

segfaults in ThreadSafeLogMessageLoggerScribe on slc7_ppc64le_gcc9 #33636

dan131riley opened this issue May 5, 2021 · 36 comments

Comments

@dan131riley
Copy link

@gartung following the update to TBB 2021.2.0 in #33474 and cms-sw/cmsdist#6792 we're seeing what looks like a race condition in the message logger on ppc:

A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Wed May  5 11:04:16 CEST 2021
TThread 5 (Thread 0x100264b88390 (LWP 128681)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000a3629ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  operator delete (ptr=0x1003358c9a80) at src/jemalloc_cpp.cpp:106
#5  0x000010000030ad8c in std::_Hashtable<unsigned int, unsigned int, std::allocator<unsigned int>, std::__detail::_Identity, std::equal_to<unsigned int>, std::hash<unsigned int>, std::__detail::_Mod_range_hashing, std::__detail::_Default_ranged_hash, std::__detail::_Prime_rehash_policy, std::__detail::_Hashtable_traits<false, true, true> >::~_Hashtable() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#6  0x00001002efcdc380 in HGCalTriggerGeometryV9Imp2::getModulePosition(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#7  0x000010022a3ac638 in HGCalConcentratorTrigSumImpl::doSum(unsigned int, std::vector<l1t::HGCalTriggerCell, std::allocator<l1t::HGCalTriggerCell> > const&, std::vector<l1t::HGCalTriggerSums, std::allocator<l1t::HGCalTriggerSums> >&) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libL1TriggerL1THGCal.so
#8  0x0000100285e69af0 in HGCalConcentratorProcessorSelection::run(edm::Handle<BXVector<l1t::HGCalTriggerCell> > const&, std::pair<BXVector<l1t::HGCalTriggerCell>, BXVector<l1t::HGCalTriggerSums> >&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_fe_be.so
#9  0x000010022a2d8a84 in HGCalConcentratorProducer::produce(edm::Event&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins.so
#10 0x0000100000427fb8 in edm::stream::EDProducerAdaptorBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#11 0x00001000003ef814 in edm::WorkerT<edm::stream::EDProducerAdaptorBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#12 0x0000100000300558 in decltype ({parm#1}()) edm::convertException::wrap<edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}>(edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#13 0x00001000003007e0 in bool edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#14 0x0000100000300c58 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr const*, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#15 0x0000100000303470 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#16 0x0000100000a0961c in tbb::detail::d1::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#17 0x00001000023a6924 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bff00, t=0x1000036e7800, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#18 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#19 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#20 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#21 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x1000097d0000) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#22 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#23 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#24 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 4 (Thread 0x100264178390 (LWP 128680)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000a3629ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  operator delete (ptr=0x1002f3fa8860) at src/jemalloc_cpp.cpp:106
#5  0x00001000008bbfa8 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_destroy() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#6  0x00001000008b6804 in edm::MessageSender::~MessageSender() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#7  0x00001002259ffb08 in HGCalDDDConstants::isValidCell8(int, int, int, int, int, int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryHGCalCommonData.so
#8  0x0000100225a0069c in HGCalDDDConstants::isValidHex8(int, int, int, int, int, bool) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryHGCalCommonData.so
#9  0x0000100225519488 in HGCalTopology::valid(DetId const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryCaloTopology.so
#10 0x00001002efcd8608 in HGCalTriggerGeometryV9Imp2::validCellId(unsigned int, unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#11 0x00001002efcdae3c in HGCalTriggerGeometryV9Imp2::validTriggerCellFromCells(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#12 0x00001002efcdb9f8 in HGCalTriggerGeometryV9Imp2::getTriggerCellsFromModule(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#13 0x00001002efcdbfb8 in HGCalTriggerGeometryV9Imp2::getCellsFromModule(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#14 0x00001002efcdc22c in HGCalTriggerGeometryV9Imp2::getModulePosition(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#15 0x000010022a3ac638 in HGCalConcentratorTrigSumImpl::doSum(unsigned int, std::vector<l1t::HGCalTriggerCell, std::allocator<l1t::HGCalTriggerCell> > const&, std::vector<l1t::HGCalTriggerSums, std::allocator<l1t::HGCalTriggerSums> >&) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libL1TriggerL1THGCal.so
#16 0x0000100285e69af0 in HGCalConcentratorProcessorSelection::run(edm::Handle<BXVector<l1t::HGCalTriggerCell> > const&, std::pair<BXVector<l1t::HGCalTriggerCell>, BXVector<l1t::HGCalTriggerSums> >&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_fe_be.so
#17 0x000010022a2d8a84 in HGCalConcentratorProducer::produce(edm::Event&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins.so
#18 0x0000100000427fb8 in edm::stream::EDProducerAdaptorBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#19 0x00001000003ef814 in edm::WorkerT<edm::stream::EDProducerAdaptorBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#20 0x0000100000300558 in decltype ({parm#1}()) edm::convertException::wrap<edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}>(edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#21 0x00001000003007e0 in bool edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#22 0x0000100000300c58 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr const*, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#23 0x0000100000303470 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#24 0x0000100000a0961c in tbb::detail::d1::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#25 0x00001000023a6924 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bfe00, t=0x100003705200, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#26 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#27 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#28 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#29 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x1000097d0100) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#30 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#31 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#32 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 3 (Thread 0x100263768390 (LWP 128679)):
#0  0x0000100002b86df8 in poll () from /lib64/libc.so.6
#1  0x000010000a363acc in full_read.constprop () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#2  0x000010000a364918 in edm::service::InitRootHandlers::stacktraceFromThread() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  0x000010000a368a0c in sig_dostack_then_abort () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#4  <signal handler called>
#5  0x000010000387bf68 in edm::service::ThreadSafeLogMessageLoggerScribe::log(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageService.so
#6  0x0000100003886efc in edm::service::ThreadSafeLogMessageLoggerScribe::runCommand(edm::MessageLoggerQ::OpCode, void*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageService.so
#7  0x00001000008b496c in edm::MessageLoggerQ::simpleCommand(edm::MessageLoggerQ::OpCode, void*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#8  0x00001000008b4bc0 in edm::MessageLoggerQ::MLqLOG(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#9  0x00001000008b8508 in edm::MessageSender::ErrorObjDeleter::operator()(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#10 0x00001000008bc650 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#11 0x00001000008b6794 in edm::MessageSender::~MessageSender() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreMessageLogger.so
#12 0x00001002259ffb08 in HGCalDDDConstants::isValidCell8(int, int, int, int, int, int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryHGCalCommonData.so
#13 0x0000100225a0069c in HGCalDDDConstants::isValidHex8(int, int, int, int, int, bool) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryHGCalCommonData.so
#14 0x0000100225519488 in HGCalTopology::valid(DetId const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libGeometryCaloTopology.so
#15 0x00001002efcd835c in HGCalTriggerGeometryV9Imp2::validCell(unsigned int) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_geometries.so
#16 0x0000100285e652f8 in HGCalVFEProcessorSums::run(edm::SortedCollection<HGCDataFrame<DetId, HGCSample>, edm::StrictWeakOrdering<HGCDataFrame<DetId, HGCSample> > > const&, BXVector<l1t::HGCalTriggerCell>&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins_fe_be.so
#17 0x000010022a2f728c in HGCalVFEProducer::produce(edm::Event&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginL1TriggerL1THGCalPlugins.so
#18 0x0000100000427fb8 in edm::stream::EDProducerAdaptorBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#19 0x00001000003ef814 in edm::WorkerT<edm::stream::EDProducerAdaptorBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#20 0x0000100000300558 in decltype ({parm#1}()) edm::convertException::wrap<edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}>(edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#21 0x00001000003007e0 in bool edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#22 0x0000100000300c58 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr const*, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#23 0x0000100000303470 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#24 0x0000100000a0961c in tbb::detail::d1::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#25 0x00001000023a6924 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bfe80, t=0x1000036f5e00, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#26 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#27 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#28 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#29 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x1000097d0080) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#30 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#31 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#32 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 1 (Thread 0x1000034c0000 (LWP 127594)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000a3629ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100001f53650 in rtree_szind_slab_read_fast (r_slab=<synthetic pointer>, r_szind=<synthetic pointer>, key=17605408200576, rtree_ctx=<optimized out>, rtree=<optimized out>, tsdn=<optimized out>) at include/jemalloc/internal/rtree.h:475
#5  free_fastpath (size_hint=false, size=0, ptr=0x1003141a1780) at src/jemalloc.c:2827
#6  free (ptr=0x1003141a1780) at src/jemalloc.c:2870
#7  0x0000100001fc4558 in operator delete (ptr=<optimized out>) at src/jemalloc_cpp.cpp:107
#8  0x000010025d780438 in std::_Rb_tree<GEMDetId, std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > >, std::_Select1st<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >, std::less<GEMDetId>, std::allocator<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > > >::_M_erase(std::_Rb_tree_node<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libDataFormatsGEMDigi.so
#9  0x000010025d780418 in std::_Rb_tree<GEMDetId, std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > >, std::_Select1st<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >, std::less<GEMDetId>, std::allocator<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > > >::_M_erase(std::_Rb_tree_node<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libDataFormatsGEMDigi.so
#10 0x000010025d780418 in std::_Rb_tree<GEMDetId, std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > >, std::_Select1st<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >, std::less<GEMDetId>, std::allocator<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > > >::_M_erase(std::_Rb_tree_node<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libDataFormatsGEMDigi.so
#11 0x000010025d780418 in std::_Rb_tree<GEMDetId, std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > >, std::_Select1st<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >, std::less<GEMDetId>, std::allocator<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > > >::_M_erase(std::_Rb_tree_node<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libDataFormatsGEMDigi.so
#12 0x000010025d780418 in std::_Rb_tree<GEMDetId, std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > >, std::_Select1st<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >, std::less<GEMDetId>, std::allocator<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > > >::_M_erase(std::_Rb_tree_node<std::pair<GEMDetId const, std::vector<GEMDigi, std::allocator<GEMDigi> > > >*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libDataFormatsGEMDigi.so
#13 0x000010028514a6c8 in GEMDigiToRawModule::produce(edm::StreamID, edm::Event&, edm::EventSetup const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginEventFilterGEMRawToDigiPlugins.so
#14 0x00001000003feefc in edm::global::EDProducerBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#15 0x00001000003ef054 in edm::WorkerT<edm::global::EDProducerBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#16 0x0000100000300558 in decltype ({parm#1}()) edm::convertException::wrap<edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}>(edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*)::{lambda()#1}) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#17 0x00001000003007e0 in bool edm::Worker::runModule<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#18 0x0000100000300c58 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr const*, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#19 0x0000100000303470 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#20 0x0000100000a0961c in tbb::detail::d1::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#21 0x00001000023c7764 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::external_waiter> (this=0x1000037bfd80, t=0x100003618c00, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#22 0x00001000023c36c4 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::external_waiter> (waiter=..., t=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:178
#23 tbb::detail::r1::task_dispatcher::execute_and_wait (t=<optimized out>, wait_ctx=..., w_ctx=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:168
#24 0x00001000023c3784 in tbb::detail::r1::wait (wait_ctx=..., w_ctx=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:126
#25 0x00001000002217fc in edm::EventProcessor::processLumis(std::shared_ptr<void> const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#26 0x000010000022e918 in edm::EventProcessor::runToCompletion() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#27 0x000000001000b928 in tbb::detail::d1::task_arena_function<main::{lambda()#1}::operator()() const::{lambda()#1}, void>::operator()() const ()
#28 0x000010000239ffe0 in tbb::detail::r1::task_arena_impl::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:674
#29 0x00001000023a1108 in tbb::detail::r1::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:403
#30 0x000000001000cb7c in main::{lambda()#1}::operator()() const ()
#31 0x000000001000ade0 in main ()

Current Modules:

Module: HGCalVFEProducer:hgcalVFEProducer (crashed)
Module: GEMDigiToRawModule:gemPacker
Module: HGCalConcentratorProducer:hgcalConcentratorProducer
Module: HGCalConcentratorProducer:hgcalConcentratorProducer

A fatal system signal has occurred: segmentation violation
@cmsbuild
Copy link
Contributor

cmsbuild commented May 5, 2021

A new Issue was created by @dan131riley Dan Riley.

@Dr15Jones, @dpiparo, @silviodonato, @smuzaffar, @makortel, @qliphy can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

@dan131riley
Copy link
Author

Also a weird segfault in edm::SerialTaskQueue::pickNextTask():

A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Wed May  5 10:52:13 CEST 2021
Thread 5 (Thread 0x100257ef8390 (LWP 80656)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000f4329ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100002b73ae8 in sched_yield () from /lib64/libc.so.6
#5  0x00001000023a6168 in __gthread_yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/powerpc64le-unknown-linux-gnu/bits/gthr-default.h:693
#6  std::this_thread::yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/thread:356
#7  tbb::detail::r1::stealing_loop_backoff::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:261
#8  tbb::detail::r1::waiter_base::pause (this=0x100257ef7860) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:35
#9  tbb::detail::r1::outermost_worker_waiter::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:69
#10 tbb::detail::r1::task_dispatcher::receive_or_steal_task<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bff00, tls=..., ed=..., waiter=..., isolation=0, fifo_allowed=<optimized out>, critical_allowed=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.h:231
#11 0x00001000023a6b34 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bff00, t=0x0, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/atomic_base.h:734
#12 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#13 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#14 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#15 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b710000) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#16 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#17 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#18 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 4 (Thread 0x1002574e8390 (LWP 80655)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000f4329ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100002b73ae8 in sched_yield () from /lib64/libc.so.6
#5  0x00001000023a6144 in __gthread_yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/powerpc64le-unknown-linux-gnu/bits/gthr-default.h:693
#6  std::this_thread::yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/thread:356
#7  tbb::detail::d0::machine_pause (delay=80) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:95
#8  tbb::detail::r1::prolonged_pause_impl () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:217
#9  tbb::detail::r1::prolonged_pause () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:233
#10 tbb::detail::r1::stealing_loop_backoff::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:258
#11 tbb::detail::r1::waiter_base::pause (this=0x1002574e7860) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:35
#12 tbb::detail::r1::outermost_worker_waiter::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:69
#13 tbb::detail::r1::task_dispatcher::receive_or_steal_task<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bfe00, tls=..., ed=..., waiter=..., isolation=0, fifo_allowed=<optimized out>, critical_allowed=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.h:231
#14 0x00001000023a6b34 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bfe00, t=0x0, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/atomic_base.h:734
#15 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#16 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#17 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#18 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b710100) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#19 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#20 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#21 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 3 (Thread 0x100256ad8390 (LWP 80654)):
#0  0x0000100002b86df8 in poll () from /lib64/libc.so.6
#1  0x000010000f433acc in full_read.constprop () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#2  0x000010000f434918 in edm::service::InitRootHandlers::stacktraceFromThread() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  0x000010000f438a0c in sig_dostack_then_abort () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#4  <signal handler called>
#5  0x0000100000a05f74 in edm::SerialTaskQueue::pickNextTask() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#6  0x0000100000a06fc8 in edm::SerialTaskQueue::pushAndGetNextTask(edm::SerialTaskQueue::TaskBase*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#7  0x0000100000a07220 in edm::SerialTaskQueue::pushTask(edm::SerialTaskQueue::TaskBase*) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#8  0x000010000021b760 in edm::EventProcessor::handleNextEventForStreamAsync(edm::WaitingTaskHolder, unsigned int) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#9  0x00001000002255b8 in edm::FunctorWaitingTask<edm::EventProcessor::beginLumiAsync(edm::IOVSyncValue const&, std::shared_ptr<void> const&, edm::WaitingTaskHolder)::{lambda(edm::LimitedTaskQueue::Resumer)#1}::operator()(edm::LimitedTaskQueue::Resumer)::{lambda()#1}::operator()()::{lambda(std::__exception_ptr::exception_ptr const*)#1}::operator()(std::__exception_ptr::exception_ptr)::{lambda()#1}::operator()()::{lambda(std::__exception_ptr::exception_ptr)#1}>::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#10 0x00001000001e7658 in tbb::detail::d1::function_task<edm::WaitingTaskHolder::doneWaiting(std::__exception_ptr::exception_ptr)::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#11 0x00001000023a6924 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x1000037bfe80, t=0x10038c712300, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#12 0x00001000023a12a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/governor.h:147
#13 tbb::detail::r1::arena::process (this=0x1000037bf780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:133
#14 0x00001000023b79e4 in tbb::detail::r1::market::process (this=0x1000037bab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/market.cpp:593
#15 0x00001000023bde9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b710080) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#16 0x00001000023be248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#17 0x0000100002a38cd4 in start_thread () from /lib64/libpthread.so.0
#18 0x0000100002b97f14 in clone () from /lib64/libc.so.6

Thread 1 (Thread 0x1000034c0000 (LWP 79460)):
#0  0x0000100002b4eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002b4e8bc in sleep () from /lib64/libc.so.6
#2  0x000010000f4329ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100002b73ae8 in sched_yield () from /lib64/libc.so.6
#5  0x00001000023c6a84 in __gthread_yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/powerpc64le-unknown-linux-gnu/bits/gthr-default.h:693
#6  std::this_thread::yield () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/thread:356
#7  tbb::detail::d0::machine_pause (delay=80) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:95
#8  tbb::detail::r1::prolonged_pause_impl () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:217
#9  tbb::detail::r1::prolonged_pause () at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:233
#10 tbb::detail::r1::stealing_loop_backoff::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/scheduler_common.h:258
#11 tbb::detail::r1::waiter_base::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:35
#12 tbb::detail::r1::external_waiter::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/waiters.h:138
#13 tbb::detail::r1::task_dispatcher::receive_or_steal_task<false, tbb::detail::r1::external_waiter> (this=0x1000037bfd80, tls=..., ed=..., waiter=..., isolation=0, fifo_allowed=<optimized out>, critical_allowed=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.h:231
#14 0x00001000023c7934 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::external_waiter> (this=0x1000037bfd80, t=0x0, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/atomic_base.h:734
#15 0x00001000023c36c4 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::external_waiter> (waiter=..., t=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:178
#16 tbb::detail::r1::task_dispatcher::execute_and_wait (t=<optimized out>, wait_ctx=..., w_ctx=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:168
#17 0x00001000023c3784 in tbb::detail::r1::wait (wait_ctx=..., w_ctx=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:126
#18 0x00001000002217fc in edm::EventProcessor::processLumis(std::shared_ptr<void> const&) () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#19 0x000010000022e918 in edm::EventProcessor::runToCompletion() () from /cvmfs/cms-ib.cern.ch/nweek-02679/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-04-2300/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#20 0x000000001000b928 in tbb::detail::d1::task_arena_function<main::{lambda()#1}::operator()() const::{lambda()#1}, void>::operator()() const ()
#21 0x000010000239ffe0 in tbb::detail::r1::task_arena_impl::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:674
#22 0x00001000023a1108 in tbb::detail::r1::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0-753991d8cffbab375ded80d936d79020/tbb-v2021.2.0/src/tbb/arena.cpp:403
#23 0x000000001000cb7c in main::{lambda()#1}::operator()() const ()
#24 0x000000001000ade0 in main ()

Current Modules:

Module: none (crashed)
Module: none
Module: none
Module: none

A fatal system signal has occurred: segmentation violation

@makortel
Copy link
Contributor

makortel commented May 5, 2021

assign core

@cmsbuild
Copy link
Contributor

cmsbuild commented May 5, 2021

New categories assigned: core

@Dr15Jones,@smuzaffar,@makortel you have been requested to review this Pull request/Issue and eventually sign? Thanks

@dan131riley
Copy link
Author

These crashes are in (at least) wf 31434.0, 34834.999, 35034.0, 35434.0, 36234.0, and 36634.0. All three are Phase2C11* Extended2026* variants with HGCal. They mostly track back to MessageLogger messages in HGCalDDDConstants::isValidCell8():

edm::LogVerbatim("HGCalGeom") << "Input " << lay << ":" << waferU << ":" << waferV << ":" << cellU << ":" << cellV
<< " N " << N << " part " << partn.first << ":" << partn.second << " Result "
<< result;

but also a few from Phase2TrackerMonitorDigi::fillOTDigiHistos(), either:

edm::LogInfo("Phase2TrackerMonitorDigi") << " column " << col << " row " << row << std::dec << std::endl;

or

edm::LogInfo("Phase2TrackerMonitorDigi") << " row " << row << " col " << col << " row_last " << row_last
<< " col_last " << col_last << " width " << digiClusters.back().width;

I don't see any obvious reason these would crash, I guess try valgrind?

@Dr15Jones
Copy link
Contributor

@dan131riley it looks to me like the HGCalDDDConstants was meant to be behind a #ifdef EDM_ML_DEBUG since all the other verbatim statements are done that way.

@Dr15Jones
Copy link
Contributor

Having debug lines would help in tracking down the MessageLogger problem.

@dan131riley
Copy link
Author

Another one in edm::SerialTaskQueue::pickNextTask():

Begin processing the 100th record. Run 320822, Event 26491514, LumiSection 17 on stream 1 at 12-May-2021 04:29:56.317 CEST


A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Wed May 12 04:29:57 CEST 2021

Thread 10 (Thread 0x100265a08380 (LWP 84463)):
#0  0x0000100003236df8 in poll () from /lib64/libc.so.6
#1  0x000010000d153acc in full_read.constprop () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#2  0x000010000d154918 in edm::service::InitRootHandlers::stacktraceFromThread() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  0x000010000d158a0c in sig_dostack_then_abort () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#4  <signal handler called>
#5  0x0000100000a05f74 in edm::SerialTaskQueue::pickNextTask() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#6  0x0000100000a06634 in edm::SerialTaskQueue::resume() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreConcurrency.so
#7  0x00001000002a50a8 in edm::FunctorWaitingTask<edm::eventsetup::EventSetupRecordIOVQueue::startNewIOVAsync(edm::WaitingTaskHolder const&, edm::WaitingTaskList&)::{lambda(edm::LimitedTaskQueue::Resumer)#1}::operator()(edm::LimitedTaskQueue::Resumer)::{lambda(std::__exception_ptr::exception_ptr const*)#1}>::execute() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#8  0x00001000001e7658 in tbb::detail::d1::function_task<edm::WaitingTaskHolder::doneWaiting(std::__exception_ptr::exception_ptr)::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#9  0x0000100002a56924 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x100003e8ff00, t=0x1002d8155300, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#10 0x0000100002a512a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/governor.h:147
#11 tbb::detail::r1::arena::process (this=0x100003e8f780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:133
#12 0x0000100002a679e4 in tbb::detail::r1::market::process (this=0x100003e8ab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/market.cpp:593
#13 0x0000100002a6de9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b9f0000) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#14 0x0000100002a6e248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#15 0x00001000030e8cd4 in start_thread () from /lib64/libpthread.so.0
#16 0x0000100003247f14 in clone () from /lib64/libc.so.6

Thread 9 (Thread 0x100262818380 (LWP 84462)):
#2  0x000010000d1529ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100003223ae8 in sched_yield () from /lib64/libc.so.6
#5  0x0000100002a56168 in __gthread_yield () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/powerpc64le-unknown-linux-gnu/bits/gthr-default.h:693
#6  std::this_thread::yield () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/thread:356
#7  tbb::detail::r1::stealing_loop_backoff::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/scheduler_common.h:261
#8  tbb::detail::r1::waiter_base::pause (this=0x100262817850) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/waiters.h:35
#9  tbb::detail::r1::outermost_worker_waiter::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/waiters.h:69
#10 tbb::detail::r1::task_dispatcher::receive_or_steal_task<false, tbb::detail::r1::outermost_worker_waiter> (this=0x100003e8fe00, tls=..., ed=..., waiter=..., isolation=0, fifo_allowed=<optimized out>, critical_allowed=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/task_dispatcher.h:231
#11 0x0000100002a56b34 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x100003e8fe00, t=0x0, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/atomic_base.h:734
#12 0x0000100002a512a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/governor.h:147
#13 tbb::detail::r1::arena::process (this=0x100003e8f780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:133
#14 0x0000100002a679e4 in tbb::detail::r1::market::process (this=0x100003e8ab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/market.cpp:593
#15 0x0000100002a6de9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b9f0100) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#16 0x0000100002a6e248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#17 0x00001000030e8cd4 in start_thread () from /lib64/libpthread.so.0
#18 0x0000100003247f14 in clone () from /lib64/libc.so.6

Thread 8 (Thread 0x100261e08380 (LWP 84461)):
#2  0x000010000d1529ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100003223ae8 in sched_yield () from /lib64/libc.so.6
#5  0x0000100002a56144 in __gthread_yield () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/powerpc64le-unknown-linux-gnu/bits/gthr-default.h:693
#6  std::this_thread::yield () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/thread:356
#7  tbb::detail::d0::machine_pause (delay=80) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:95
#8  tbb::detail::r1::prolonged_pause_impl () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/scheduler_common.h:217
#9  tbb::detail::r1::prolonged_pause () at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/scheduler_common.h:233
#10 tbb::detail::r1::stealing_loop_backoff::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/scheduler_common.h:258
#11 tbb::detail::r1::waiter_base::pause (this=0x100261e07850) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/waiters.h:35
#12 tbb::detail::r1::outermost_worker_waiter::pause (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/waiters.h:69
#13 tbb::detail::r1::task_dispatcher::receive_or_steal_task<false, tbb::detail::r1::outermost_worker_waiter> (this=0x100003e8fe80, tls=..., ed=..., waiter=..., isolation=0, fifo_allowed=<optimized out>, critical_allowed=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/task_dispatcher.h:231
#14 0x0000100002a56b34 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::outermost_worker_waiter> (this=0x100003e8fe80, t=0x0, waiter=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/atomic_base.h:734
#15 0x0000100002a512a0 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::outermost_worker_waiter> (waiter=..., t=0x0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/governor.h:147
#16 tbb::detail::r1::arena::process (this=0x100003e8f780, tls=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:133
#17 0x0000100002a679e4 in tbb::detail::r1::market::process (this=0x100003e8ab00, j=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/market.cpp:593
#18 0x0000100002a6de9c in tbb::detail::r1::rml::private_worker::run (this=0x10000b9f0080) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:266
#19 0x0000100002a6e248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#20 0x00001000030e8cd4 in start_thread () from /lib64/libpthread.so.0
#21 0x0000100003247f14 in clone () from /lib64/libc.so.6

Thread 1 (Thread 0x100003b90000 (LWP 82424)):
#2  0x000010000d1529ac in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x000010000020583c in __gnu_cxx::__normal_iterator<edm::eventsetup::EventSetupRecordKey const*, std::vector<edm::eventsetup::EventSetupRecordKey, std::allocator<edm::eventsetup::EventSetupRecordKey> > > std::__lower_bound<__gnu_cxx::__normal_iterator<edm::eventsetup::EventSetupRecordKey const*, std::vector<edm::eventsetup::EventSetupRecordKey, std::allocator<edm::eventsetup::EventSetupRecordKey> > >, edm::eventsetup::EventSetupRecordKey, __gnu_cxx::__ops::_Iter_less_val>(__gnu_cxx::__normal_iterator<edm::eventsetup::EventSetupRecordKey const*, std::vector<edm::eventsetup::EventSetupRecordKey, std::allocator<edm::eventsetup::EventSetupRecordKey> > >, __gnu_cxx::__normal_iterator<edm::eventsetup::EventSetupRecordKey const*, std::vector<edm::eventsetup::EventSetupRecordKey, std::allocator<edm::eventsetup::EventSetupRecordKey> > >, edm::eventsetup::EventSetupRecordKey const&, __gnu_cxx::__ops::_Iter_less_val) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#5  0x0000100000285344 in edm::EventSetupImpl::insertRecordImpl(edm::eventsetup::EventSetupRecordKey const&, edm::eventsetup::EventSetupRecordImpl const*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#6  0x00001000002855a0 in edm::EventSetupImpl::addRecordImpl(edm::eventsetup::EventSetupRecordImpl const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#7  0x00001000002acdf4 in edm::eventsetup::EventSetupRecordProvider::continueIOV(bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#8  0x00001000002a794c in edm::eventsetup::EventSetupRecordIOVQueue::checkForNewIOVs(edm::WaitingTaskHolder const&, edm::WaitingTaskList&, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#9  0x00001000002b6ff4 in edm::eventsetup::EventSetupsController::eventSetupForInstanceAsync(edm::IOVSyncValue const&, edm::WaitingTaskHolder const&, edm::WaitingTaskList&, std::vector<std::shared_ptr<edm::EventSetupImpl const>, std::allocator<std::shared_ptr<edm::EventSetupImpl const> > >&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#10 0x00001000002b7558 in edm::eventsetup::synchronousEventSetupForInstance(edm::IOVSyncValue const&, tbb::detail::d1::task_group&, edm::eventsetup::EventSetupsController&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#11 0x00001000002271fc in edm::EventProcessor::endRun(edm::Hash<2> const&, unsigned int, bool, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#12 0x0000100000227aa0 in edm::EventProcessor::endUnfinishedRun(edm::Hash<2> const&, unsigned int, bool, bool, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#13 0x0000100000227d9c in std::_Sp_counted_ptr_inplace<edm::(anonymous namespace)::RunResources, std::allocator<edm::(anonymous namespace)::RunResources>, (__gnu_cxx::_Lock_policy)2>::_M_dispose() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#14 0x000010000022f408 in edm::EventProcessor::runToCompletion() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-11-2100/lib/slc7_ppc64le_gcc9/libFWCoreFramework.so
#15 0x000000001000b928 in tbb::detail::d1::task_arena_function<main::{lambda()#1}::operator()() const::{lambda()#1}, void>::operator()() const ()
#16 0x0000100002a4ffe0 in tbb::detail::r1::task_arena_impl::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:674
#17 0x0000100002a51108 in tbb::detail::r1::execute (ta=..., d=...) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:403
#18 0x000000001000cb7c in main::{lambda()#1}::operator()() const ()
#19 0x000000001000ade0 in main ()

Current Modules:

Module: none (crashed)
Module: none
Module: none
Module: none

A fatal system signal has occurred: segmentation violation
timeout: the monitored command dumped core

@dan131riley
Copy link
Author

Another in edm::service::ThreadSafeLogMessageLoggerScribe::log(). This one is notable because it's wf
136.812, which is Run2_2017, not upgrade:

Begin processing the 25th record. Run 302663, Event 149688, LumiSection 1 on stream 1 at 11-May-2021 19:37:56.395 CEST


A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Tue May 11 19:38:11 CEST 2021
Thread 5 (Thread 0x10025bb18460 (LWP 90600)):
#2  0x000010000f1f294c in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x000010022f7846ec in JacobianLocalToCurvilinear::compute(TkRotation<float> const&, Vector3DBase<float, LocalTag> const&, Vector3DBase<float, GlobalTag> const&, Vector3DBase<float, GlobalTag> const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsAnalyticalJacobians.so
#5  0x000010022f784a70 in JacobianLocalToCurvilinear::JacobianLocalToCurvilinear(Surface const&, LocalTrajectoryParameters const&, GlobalTrajectoryParameters const&, MagneticField const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsAnalyticalJacobians.so
#6  0x000010022f276548 in BasicTrajectoryState::checkCurvilinError() const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsTrajectoryState.so
#7  0x000010022f21fb34 in Propagator::propagateWithPath(TrajectoryStateOnSurface const&, Plane const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsGeomPropagators.so
#8  0x00001002363f2448 in MultiStatePropagation<Plane>::propagateWithPath(TrajectoryStateOnSurface const&, Plane const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsGsfTools.so
#9  0x00001002363f18e0 in GsfPropagatorAdapter::propagateWithPath(TrajectoryStateOnSurface const&, Plane const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsGsfTools.so
#10 0x000010022f1c7d98 in TransverseImpactPointExtrapolator::doExtrapolation(TrajectoryStateOnSurface, Point3DBase<float, GlobalTag> const&, Propagator const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsPatternTools.so
#11 0x000010022f1c8504 in TransverseImpactPointExtrapolator::extrapolate(TrajectoryStateOnSurface, Point3DBase<float, GlobalTag> const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libTrackingToolsPatternTools.so
#12 0x000010027c50c1b4 in GsfTrackProducerBase::fillMode(reco::GsfTrack&, TrajectoryStateOnSurface, Propagator const&, TransverseImpactPointExtrapolator const&, TrajectoryStateClosestToBeamLineBuilder&, reco::BeamSpot const&) const () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libRecoTrackerTrackProducer.so
#13 0x000010027c50e93c in GsfTrackProducerBase::putInEvt(edm::Event&, Propagator const*, MeasurementTracker const*, std::unique_ptr<edm::OwnVector<TrackingRecHit, edm::ClonePolicy<TrackingRecHit> >, std::default_delete<edm::OwnVector<TrackingRecHit, edm::ClonePolicy<TrackingRecHit> > > >&, std::unique_ptr<std::vector<reco::GsfTrack, std::allocator<reco::GsfTrack> >, std::default_delete<std::vector<reco::GsfTrack, std::allocator<reco::GsfTrack> > > >&, std::unique_ptr<std::vector<reco::TrackExtra, std::allocator<reco::TrackExtra> >, std::default_delete<std::vector<reco::TrackExtra, std::allocator<reco::TrackExtra> > > >&, std::unique_ptr<std::vector<reco::GsfTrackExtra, std::allocator<reco::GsfTrackExtra> >, std::default_delete<std::vector<reco::GsfTrackExtra, std::allocator<reco::GsfTrackExtra> > > >&, std::unique_ptr<std::vector<Trajectory, std::allocator<Trajectory> >, std::default_delete<std::vector<Trajectory, std::allocator<Trajectory> > > >&, std::vector<AlgoProductTraits<reco::GsfTrack>::AlgoProduct, std::allocator<AlgoProductTraits<reco::GsfTrack>::AlgoProduct> >&, TransientTrackingRecHitBuilder const*, reco::BeamSpot const&, TrackerTopology const*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libRecoTrackerTrackProducer.so
#14 0x000010027c34014c in GsfTrackProducer::produce(edm::Event&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginRecoTrackerTrackProducerPlugins.so

Thread 4 (Thread 0x10025b108460 (LWP 90599)):
#2  0x000010000f1f294c in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100002b5a568 in __strchrnul_power8 () from /lib64/libc.so.6
#5  0x0000100002b20050 in __parse_one_specmb () from /lib64/libc.so.6
#6  0x0000100002b02078 in printf_positional () from /lib64/libc.so.6
#7  0x0000100002b049e0 in vfprintf@@GLIBC_2.17 () from /lib64/libc.so.6
#8  0x0000100002b2f460 in vsnprintf@@GLIBC_2.17 () from /lib64/libc.so.6
#9  0x0000100002c2a598 in __nldbl_snprintf () from /lib64/libc.so.6
#10 0x0000100263ba37d8 in SiStripHistoId::getSubdetid[abi:cxx11](unsigned int, TrackerTopology const*, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libDQMSiStripCommon.so
#11 0x000010027c25b294 in SiStripMonitorTrack::findMEs(TrackerTopology const*, unsigned int) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so
#12 0x000010027c2770e0 in void SiStripMonitorTrack::RecHitInfo<SiStripRecHit2D>(SiStripRecHit2D const*, Vector3DBase<float, LocalTag>, edm::DetSetVector<SiStripDigi> const&, edm::Event const&, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so
#13 0x000010027c25de50 in SiStripMonitorTrack::trajectoryStudy(reco::Track const&, edm::DetSetVector<SiStripDigi> const&, edm::Event const&, bool) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so
#14 0x000010027c25e318 in SiStripMonitorTrack::trackStudyFromTrajectory(edm::Handle<std::vector<reco::Track, std::allocator<reco::Track> > >, edm::DetSetVector<SiStripDigi> const&, edm::Event const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so
#15 0x000010027c25e9b8 in SiStripMonitorTrack::trackStudy(edm::Event const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so
#16 0x000010027c260b68 in SiStripMonitorTrack::analyze(edm::Event const&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMSiStripMonitorTrack.so

Thread 3 (Thread 0x10025a6f8460 (LWP 90598)):
#2  0x000010000f1f294c in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000100002a5deb0 in __pthread_mutex_unlock_usercnt () from /lib64/libpthread.so.0
#5  0x0000100001f5d728 in malloc_mutex_unlock (tsdn=0x10025a6fe050, mutex=0x10025c007c18) at include/jemalloc/internal/mutex.h:234
#6  je_arena_tcache_fill_small (tsdn=0x10025a6fe050, arena=0x10025c000c80, tcache=<optimized out>, tbin=0x10025a6fe2b8, binind=<optimized out>, prof_accumbytes=<optimized out>) at src/arena.c:1440
#7  0x0000100001fbed5c in je_tcache_alloc_small_hard (tsdn=<optimized out>, arena=<optimized out>, tcache=<optimized out>, tbin=<optimized out>, binind=<optimized out>, tcache_success=0x10025a6f6e00) at src/tcache.c:94
#8  0x0000100001f504cc in tcache_alloc_small (slow_path=false, zero=false, binind=<optimized out>, size=<optimized out>, tcache=0x10025a6fe260, arena=0x10025c000c80, tsd=0x10025a6fe050) at include/jemalloc/internal/tsd.h:228
#9  arena_malloc (slow_path=false, tcache=0x10025a6fe260, zero=false, ind=<optimized out>, size=<optimized out>, arena=0x0, tsdn=0x10025a6fe050) at include/jemalloc/internal/arena_inlines_b.h:165
#10 iallocztm (slow_path=false, arena=0x0, is_internal=false, tcache=0x10025a6fe260, zero=false, ind=<optimized out>, size=<optimized out>, tsdn=0x10025a6fe050) at include/jemalloc/internal/jemalloc_internal_inlines_c.h:53
#11 imalloc_no_sample (ind=<optimized out>, usize=48, size=<optimized out>, tsd=0x10025a6fe050, dopts=<synthetic pointer>, sopts=<synthetic pointer>) at src/jemalloc.c:1949
#12 imalloc_body (tsd=0x10025a6fe050, dopts=<synthetic pointer>, sopts=<synthetic pointer>) at src/jemalloc.c:2149
#13 imalloc (dopts=<synthetic pointer>, sopts=<synthetic pointer>) at src/jemalloc.c:2260
#14 je_malloc_default (size=<optimized out>) at src/jemalloc.c:2291
#15 0x0000100001fc4700 in newImpl<false> (size=<optimized out>) at src/jemalloc_cpp.cpp:77
#16 operator new (size=<optimized out>) at src/jemalloc_cpp.cpp:87
#17 0x000010000089f7a0 in edm::ErrorObj::emitToken(std::basic_string_view<char, std::char_traits<char> >) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#18 0x000010026bed6f34 in edm::ErrorObj& edm::ErrorObj::opltlt<float>(float const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMTrackingMonitor.so
#19 0x000010026bf85ee4 in TrackingRecoMaterialAnalyser::analyze(edm::Event const&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMTrackingMonitor.so

Thread 1 (Thread 0x1000025bb1c0 (LWP 63535)):
#3  0x000010000f1f89ac in sig_dostack_then_abort () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginFWCoreServicesPlugins.so
#4  <signal handler called>
#5  0x000010000367bea8 in edm::service::ThreadSafeLogMessageLoggerScribe::log(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageService.so
#6  0x0000100003686e3c in edm::service::ThreadSafeLogMessageLoggerScribe::runCommand(edm::MessageLoggerQ::OpCode, void*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageService.so
#7  0x00001000008b48ec in edm::MessageLoggerQ::simpleCommand(edm::MessageLoggerQ::OpCode, void*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#8  0x00001000008b4b40 in edm::MessageLoggerQ::MLqLOG(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#9  0x00001000008b8488 in edm::MessageSender::ErrorObjDeleter::operator()(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#10 0x00001000008bc5d0 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose() () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#11 0x00001000008b6714 in edm::MessageSender::~MessageSender() () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/libFWCoreMessageLogger.so
#12 0x000010026bf85f8c in TrackingRecoMaterialAnalyser::analyze(edm::Event const&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/nweek-02680/cc8_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-10-2300/lib/cc8_ppc64le_gcc9/pluginDQMTrackingMonitor.so

Current Modules:

Module: TrackingRecoMaterialAnalyser:materialDumperAnalyzer (crashed)
Module: GsfTrackProducer:electronGsfTracks
Module: SiStripMonitorTrack:SiStripMonitorTrackCommon
Module: TrackingRecoMaterialAnalyser:materialDumperAnalyzer

A fatal system signal has occurred: segmentation violation

@Dr15Jones
Copy link
Contributor

@smuzaffar @mrodozov would it be possible to build the PPC IB with debug symbols? Getting line numbers for the crash would be super useful.

@mrodozov
Copy link
Contributor

mrodozov commented May 12, 2021

I'll check if I can bring the _DBG_ IB for PPC

@mrodozov
Copy link
Contributor

I'm adding a hint for myself here to not forget the new relvals failures, apart from the Geom ones may be failing because of this
@mrodozov

@Dr15Jones
Copy link
Contributor

I'll check if I can bring the DBG IB for PPC

In general, for non-production architectures, would it make sense to always build debug symbols but not necessarily with optimization disabled, i.e. just -g flag? Since we are using these IBs primarily for for testing of the software having the tracebacks include line numbers would greatly aid fixing problems.

@mrodozov
Copy link
Contributor

There might be a problem when building in debug all "experimental" IBs because we use shared resources for that and it might take longer for testing. But we can try few times and check if it actually does, if it doesn't maybe yes. For PPC this may fly under the radar but I doubt it on Arm machines which are slow even now

@Dr15Jones
Copy link
Contributor

There might be a problem when building in debug all "experimental" IBs because we use shared resources for that and it might take longer for testing.

My suggestion is to try just generating debug symbol, -g, but leave optimization on. That should allow the code to run nearly as fast as before but be able to give us more debugging info in the case of a crash.

@mrodozov
Copy link
Contributor

We can try that yes. The DEBUG PPC IB will appear with -1700 hour some time soon

@Dr15Jones
Copy link
Contributor

@mrodozov so we had another seg fault in the 2300 build on ppc but there were no additional debug information. Was the debug only applied to one specific build?

@mrodozov
Copy link
Contributor

to the one specific build only in 1700, yes. the one in 2300 was with the existing compiler options (no debug info yet)

@dan131riley
Copy link
Author

@mrodozov The 1700 build failed, it was compiled with "-g -O0" and we apparently still have some modules that won't build correctly with optimization off. We should try to fix those, but in the meantime is it possible to do what @Dr15Jones suggested, add "-g" but leave the normal optimization flags?

@mrodozov
Copy link
Contributor

I'm searching where are the options for the _DBG_ IBs so I can turn this optimization to 3 from 0

@dan131riley
Copy link
Author

I'm searching where are the options for the DBG IBs so I can turn this optimization to 3 from 0

Looks like https://github.com/cms-sw/cmsdist/blob/4f5e67a29b506c0312331b3c6a1ce4496e616f6e/cmssw-queue-override.file#L16-L18

@mrodozov
Copy link
Contributor

mrodozov commented May 13, 2021

done. github is not very useful for searching basic things 😃 (I did search for _DBG_ rather then DBG only tho)

@mrodozov
Copy link
Contributor

started an IB for -2100 (and the right date this time)

@dan131riley
Copy link
Author

Bad/good/bad news story.

Bad: even with optimization on, there's still a bunch of stuff that doesn't compile with EDM_ML_DEBUG defined, so there's lots in the latest build that still didn't compile. We need a campaign to make DBG builds work.

Good: enough worked that we did get a bunch of stack traces, partly because EDM_ML_DEBUG turned on a bunch more logging. Representative stack trace below, no conclusions yet.

Bad: none of the more elusive edm::SerialTaskQueue::pickNextTask() segfaults.

Begin processing the 500th record. Run 1, Event 500, LumiSection 1 on stream 3 at 14-May-2021 05:17:55.511 CEST


A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Fri May 14 05:17:55 CEST 2021
Thread 6 (Thread 0x10023eb68390 (LWP 35711)):
#3  0x00001000087fd5c8 in (anonymous namespace)::sig_dostack_then_abort (sig=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/Services/plugins/InitRootHandlers.cc:539
#4  <signal handler called>
#5  std::__atomic_base<unsigned long>::load (__m=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/include/oneapi/tbb/cache_aligned_allocator.h:50
#6  tbb::detail::d1::micro_queue<edm::ErrorObj*, tbb::detail::d1::cache_aligned_allocator<edm::ErrorObj*> >::pop (base=..., k=37702144, dst=<synthetic pointer>, this=0x100003887df0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/include/oneapi/tbb/detail/_concurrent_queue_base.h:189
#7  tbb::detail::d1::concurrent_queue<edm::ErrorObj*, tbb::detail::d1::cache_aligned_allocator<edm::ErrorObj*> >::internal_try_pop (dst=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/include/oneapi/tbb/concurrent_queue.h:196
#8  tbb::detail::d1::concurrent_queue<edm::ErrorObj*, tbb::detail::d1::cache_aligned_allocator<edm::ErrorObj*> >::try_pop (result=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/include/oneapi/tbb/concurrent_queue.h:135
#9  edm::service::ThreadSafeLogMessageLoggerScribe::log (this=0x100005310210, errorobj_p=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageService/src/ThreadSafeLogMessageLoggerScribe.cc:173
#10 0x000010000394abac in edm::service::ThreadSafeLogMessageLoggerScribe::runCommand (this=<optimized out>, opcode=<optimized out>, operand=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageService/src/ThreadSafeLogMessageLoggerScribe.cc:86
#11 0x0000100000917b8c in edm::MessageLoggerQ::simpleCommand (opcode=<optimized out>, operand=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1020
#12 0x0000100000917de0 in edm::MessageLoggerQ::MLqLOG (p=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/MessageLoggerQ.cc:157
#13 0x000010000091a294 in edm::MessageSender::ErrorObjDeleter::operator() (this=<optimized out>, errorObjPtr=0x1002401bc980) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/MessageSender.cc:130
#14 0x0000100000920100 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose (this=<error reading variable: value has been optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:470
#15 0x000010000091b1d4 in std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release (this=0x10026a70c100) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:148
#16 std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release (this=0x10026a70c100) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:148
#17 std::__shared_count<(__gnu_cxx::_Lock_policy)2>::~__shared_count (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:730
#18 std::__shared_ptr<edm::ErrorObj, (__gnu_cxx::_Lock_policy)2>::~__shared_ptr (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1169
#19 std::shared_ptr<edm::ErrorObj>::~shared_ptr (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr.h:103
#20 edm::MessageSender::~MessageSender (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/MessageSender.cc:140
#21 0x0000100248b626f0 in edm::Log<edm::level::Info, true>::~Log (this=0x10023eb66800, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/interface/MessageLogger.h:78
#22 CaloSD::update (this=0x100247a90000, trk=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4CMS/Calo/src/CaloSD.cc:972
#23 0x0000100235da2540 in Observer<BeginOfTrack const*>::slotForUpdate (iT=0x10023eb668a0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#24 sim_act::Signaler<BeginOfTrack>::operator() (iSignal=0x10023eb668a0, this=0x1002473c0080) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#25 sim_act::Signaler<BeginOfTrack>::update (iData=0x10023eb668a0, this=0x1002473c0080) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:64
#26 Observer<BeginOfTrack const*>::slotForUpdate (iT=0x10023eb668a0, this=0x1002473c0080) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#27 sim_act::Signaler<BeginOfTrack>::operator() (iSignal=0x10023eb668a0, this=0x100247373df0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#28 TrackingAction::PreUserTrackingAction (this=0x100247373de0, aTrack=0x10026a3c7a10) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Application/src/TrackingAction.cc:36
#29 0x00001002391cd48c in G4TrackingManager::ProcessOneTrack(G4Track*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4tracking.so
#30 0x0000100236375678 in G4EventManager::DoProcessing(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#31 0x0000100236375ca0 in G4EventManager::ProcessOneEvent(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#32 0x0000100235d8e53c in RunManagerMTWorker::produce (this=0x100006756400, inpevt=..., es=..., runManagerMaster=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/unique_ptr.h:360
#33 0x0000100235cfe96c in OscarMTProducer::produce (this=<optimized out>, e=..., es=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1020

Thread 5 (Thread 0x10023e158390 (LWP 35710)):
#0  0x0000100002c0eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002c0e8bc in sleep () from /lib64/libc.so.6
#2  0x00001000087f6e8c in (anonymous namespace)::sig_pause_for_stacktrace (sig=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/Services/plugins/InitRootHandlers.cc:450
#3  <signal handler called>
#4  0x0000100002c4fa14 in syscall () from /lib64/libc.so.6
#5  0x000010000247e1f0 in tbb::detail::r1::futex_wait (comparand=2, futex=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/semaphore.h:289
#6  tbb::detail::r1::binary_semaphore::P (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/semaphore.h:289
#7  tbb::detail::r1::rml::internal::thread_monitor::commit_wait (c=..., this=0x100007c90120) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/rml_thread_monitor.h:242
#8  tbb::detail::r1::rml::private_worker::run (this=0x100007c90100) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:273
#9  0x000010000247e248 in tbb::detail::r1::rml::private_worker::thread_routine (arg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_ppc64le_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_ppc64le_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/private_server.cpp:220
#10 0x0000100002af8cd4 in start_thread () from /lib64/libpthread.so.0
#11 0x0000100002c57f14 in clone () from /lib64/libc.so.6

Thread 4 (Thread 0x10023d748390 (LWP 35709)):
#0  0x0000100002c0eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002c0e8bc in sleep () from /lib64/libc.so.6
#2  0x00001000087f6e8c in (anonymous namespace)::sig_pause_for_stacktrace (sig=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/Services/plugins/InitRootHandlers.cc:450
#3  <signal handler called>
#4  rtree_szind_slab_read_fast (r_slab=<synthetic pointer>, r_szind=<synthetic pointer>, key=17601859387168, rtree_ctx=<optimized out>, rtree=<optimized out>, tsdn=<optimized out>) at include/jemalloc/internal/rtree.h:475
#5  free_fastpath (size_hint=false, size=0, ptr=0x100240937f20) at src/jemalloc.c:2827
#6  free (ptr=0x100240937f20) at src/jemalloc.c:2870
#7  0x0000100002084558 in operator delete (ptr=<optimized out>) at src/jemalloc_cpp.cpp:107
#8  0x000010000091a248 in __gnu_cxx::new_allocator<char>::deallocate (this=0x10023d746340, __p=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/ext/new_allocator.h:119
#9  std::allocator_traits<std::allocator<char> >::deallocate (__a=..., __n=<error reading variable: value has been optimized out>, __p=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/alloc_traits.h:470
#10 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_destroy (__size=<error reading variable: value has been optimized out>, this=0x10023d746340) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:237
#11 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_dispose (this=0x10023d746340) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:232
#12 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::~basic_string (this=0x10023d746340, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:658
#13 edm::MessageSender::ErrorObjDeleter::operator() (this=<optimized out>, errorObjPtr=0x1002401c1980) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/MessageSender.cc:108
#14 0x0000100000920100 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose (this=<error reading variable: value has been optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:470
#15 0x000010000091b1d4 in std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release (this=0x10026ca77160) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:148
#16 std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release (this=0x10026ca77160) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:148
#17 std::__shared_count<(__gnu_cxx::_Lock_policy)2>::~__shared_count (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:730
#18 std::__shared_ptr<edm::ErrorObj, (__gnu_cxx::_Lock_policy)2>::~__shared_ptr (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1169
#19 std::shared_ptr<edm::ErrorObj>::~shared_ptr (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr.h:103
#20 edm::MessageSender::~MessageSender (this=<optimized out>, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/MessageSender.cc:140
#21 0x0000100248b7df58 in edm::Log<edm::level::Info, true>::~Log (this=0x10023d7466c8, __in_chrg=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/interface/MessageLogger.h:78
#22 CaloTrkProcessing::update (this=0x1002402449c0, aStep=0x1002400706c0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4CMS/Calo/src/CaloTrkProcessing.cc:263
#23 0x0000100235d9e140 in Observer<G4Step const*>::slotForUpdate (iT=<optimized out>, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#24 sim_act::Signaler<G4Step>::operator() (iSignal=<optimized out>, this=0x1002402400a0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#25 sim_act::Signaler<G4Step>::update (iData=<optimized out>, this=0x1002402400a0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:64
#26 Observer<G4Step const*>::slotForUpdate (iT=<optimized out>, this=0x1002402400a0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#27 sim_act::Signaler<G4Step>::operator() (iSignal=0x1002400706c0, this=0x100240393e90) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#28 SteppingAction::UserSteppingAction (this=0x100240393e80, aStep=0x1002400706c0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Application/src/SteppingAction.cc:95
#29 0x00001002391bfaac in G4SteppingManager::Stepping() () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4tracking.so
#30 0x00001002391cd3d4 in G4TrackingManager::ProcessOneTrack(G4Track*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4tracking.so
#31 0x0000100236375678 in G4EventManager::DoProcessing(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#32 0x0000100236375ca0 in G4EventManager::ProcessOneEvent(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#33 0x0000100235d8e53c in RunManagerMTWorker::produce (this=0x100006758700, inpevt=..., es=..., runManagerMaster=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/unique_ptr.h:360
#34 0x0000100235cfe96c in OscarMTProducer::produce (this=<optimized out>, e=..., es=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1020

Thread 1 (Thread 0x100003580000 (LWP 35575)):
#0  0x0000100002c0eb88 in nanosleep () from /lib64/libc.so.6
#1  0x0000100002c0e8bc in sleep () from /lib64/libc.so.6
#2  0x00001000087f6e8c in (anonymous namespace)::sig_pause_for_stacktrace (sig=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/Services/plugins/InitRootHandlers.cc:450
#3  <signal handler called>
#4  0x0000100000901a04 in std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::replace (__n2=21, __s=0x1002401b3880 " primary ancestor ID \020", __n1=0, __pos=0, this=0x3ffff2d2b7b0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:1936
#5  std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::replace (__k2=0x1002401b3895 "\020", __k1=0x1002401b3880 " primary ancestor ID \020", __i2=..., __i1=..., this=0x3ffff2d2b7b0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:2130
#6  std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::assign<char*, void> (__last=0x1002401b3895 "\020", __first=0x1002401b3880 " primary ancestor ID \020", this=0x3ffff2d2b7b0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/basic_string.h:1471
#7  std::__cxx11::basic_stringbuf<char, std::char_traits<char>, std::allocator<char> >::str (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/sstream:185
#8  std::__cxx11::basic_ostringstream<char, std::char_traits<char>, std::allocator<char> >::str (this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/sstream:678
#9  edm::ErrorObj::opltlt (this=0x1002401b5400, s=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/ErrorObj.cc:252
#10 0x0000100000901b58 in edm::operator<< (e=..., s=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/src/ErrorObj.cc:258
#11 0x0000100248b626cc in edm::MessageSender::operator<< <char [22]> (t=..., this=0x3ffff2d2b840) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/interface/MessageSender.h:47
#12 edm::Log<edm::level::Info, true>::operator<< <char [22]> (t=..., this=0x3ffff2d2b840) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/FWCore/MessageLogger/interface/MessageLogger.h:83
#13 CaloSD::update (this=0x10025c9ff500, trk=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4CMS/Calo/src/CaloSD.cc:973
#14 0x0000100235da2540 in Observer<BeginOfTrack const*>::slotForUpdate (iT=0x3ffff2d2b8e0, this=<optimized out>) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#15 sim_act::Signaler<BeginOfTrack>::operator() (iSignal=0x3ffff2d2b8e0, this=0x10022f9e92c0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#16 sim_act::Signaler<BeginOfTrack>::update (iData=0x3ffff2d2b8e0, this=0x10022f9e92c0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:64
#17 Observer<BeginOfTrack const*>::slotForUpdate (iT=0x3ffff2d2b8e0, this=0x10022f9e92c0) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Observer.h:33
#18 sim_act::Signaler<BeginOfTrack>::operator() (iSignal=0x3ffff2d2b8e0, this=0x10024155ff70) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Notification/interface/Signaler.h:46
#19 TrackingAction::PreUserTrackingAction (this=0x10024155ff60, aTrack=0x100242bc5508) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/tmp/BUILDROOT/d10bc25d291e839fd3f4b74a19d4477e/opt/cmssw/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/src/SimG4Core/Application/src/TrackingAction.cc:36
#20 0x00001002391cd48c in G4TrackingManager::ProcessOneTrack(G4Track*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4tracking.so
#21 0x0000100236375678 in G4EventManager::DoProcessing(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#22 0x0000100236375ca0 in G4EventManager::ProcessOneEvent(G4Event*) () from /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_ppc64le_gcc9/cms/cmssw/CMSSW_12_0_DBG_X_2021-05-13-2100/external/slc7_ppc64le_gcc9/lib/libG4event.so
#23 0x0000100235d8e53c in RunManagerMTWorker::produce (this=0x10000676d700, inpevt=..., es=..., runManagerMaster=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/unique_ptr.h:360
#24 0x0000100235cfe96c in OscarMTProducer::produce (this=<optimized out>, e=..., es=...) at /scratch/cmsbuild/jenkins_a/workspace/build-any-ib/w/slc7_ppc64le_gcc9/external/gcc/9.3.0/include/c++/9.3.0/bits/shared_ptr_base.h:1020

Current Modules:

Module: OscarMTProducer:g4SimHits (crashed)
Module: OscarMTProducer:g4SimHits
Module: OscarMTProducer:g4SimHits
Module: none

A fatal system signal has occurred: segmentation violation

@dan131riley
Copy link
Author

@dan131riley it looks to me like the HGCalDDDConstants was meant to be behind a #ifdef EDM_ML_DEBUG since all the other verbatim statements are done that way.

Yep, so it's probably a likely source of the crash because it gets executed a lot more than intended, similar to how turning on EDM_ML_DEBUG produces a much higher likelihood of crashes (and compilation failures).

@mrodozov
Copy link
Contributor

I started fixing the obvious stuff although I'm not sure with what rate I fix things and new failures are put for example in L1Trigger 😃
The one in FWCore/Framework is my bad because of an external (libunwind still fail to install in lib (vs lib64) although I "fixed" it) -> cms-sw/cmsdist#6886
The others:
#33719
#33718
so consider them first before fixing the rest (if you are attempting to, I mean)

@Dr15Jones
Copy link
Contributor

See #33734 to track issues with DBG build

@mrodozov
Copy link
Contributor

the framework missing lib should be fixed with cms-sw/cmsdist#6906 (tested on ppc to make sure this time)

@Dr15Jones
Copy link
Contributor

The place where the MessageLogger and where edm::SerialTaskQueue::pickNextTask() are failing both are using a tbb::concurrent_queue.

@Dr15Jones
Copy link
Contributor

So looking at the failure in the _DBG build, the method in TBB that is failing is the following (from: /cvmfs/cms-ib.cern.ch/nweek-02680/slc7_amd64_gcc900/external/tbb/v2021.2.0/include/oneapi/tbb/detail/_concurrent_queue_base.h )

    175     bool pop( void* dst, ticket_type k, queue_rep_type& base ) {
    176         k &= -queue_rep_type::n_queue;
    177         if (head_counter.load(std::memory_order_relaxed) != k) spin_wait_until_eq(head_counter, k);
    178         call_itt_notify(acquired, &head_counter);
    179         if (tail_counter.load(std::memory_order_relaxed) == k) spin_wait_while_eq(tail_counter, k);
    180         call_itt_notify(acquired, &tail_counter);
    181         padded_page *p = head_page.load(std::memory_order_acquire);
    182         __TBB_ASSERT( p, nullptr );
    183         size_type index = modulo_power_of_two( k/queue_rep_type::n_queue, items_per_page );
    184         bool success = false;
    185         {
    186             page_allocator_type page_allocator(base.get_allocator());
    187             micro_queue_pop_finalizer<self_type, value_type, page_allocator_type> finalizer(*this, page_allocator,
    188                 k + queue_rep_type::n_queue, index == items_per_page - 1 ? p : nullptr );
    189             if (p->mask.load(std::memory_order_relaxed) & (std::uintptr_t(1) << index)) {
    190                 success = true;
    191                 assign_and_destroy_item( dst, *p, index );
    192             } else {
    193                 --base.n_invalid_entries;
    194             }
    195         }
    196         return success;
    197     }

It fails at line 189 with a segmentation fault. The most likely reason is p == nullptr. Presumably line 182 is meant to test that (if TBB is built with its debug diagnostics).

@dan131riley
Copy link
Author

We got a segfault in ThreadSafeLogMessageLoggerScribe::log() on slc7_aarch64_gcc9, which shifts my prior from a PPC-specific bug to a more generic logic error in tbb::concurrent_queue.

Begin processing the 26th record. Run 315489, Event 20121265, LumiSection 33 on stream 0 at 16-May-2021 11:12:30.336 CEST

A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Sun May 16 11:12:32 CEST 2021
Thread 5 (Thread 0xffff499183c0 (LWP 22151)):
#0  0x0000ffffbbb54e24 in poll () from /lib64/libc.so.6
#1  0x0000ffffb6f56050 in full_read.constprop () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#2  0x0000ffffb6f569d4 in edm::service::InitRootHandlers::stacktraceFromThread() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#3  0x0000ffffb6f59c18 in sig_dostack_then_abort () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#4  <signal handler called>
#5  0x0000ffffbb053c98 in edm::service::ThreadSafeLogMessageLoggerScribe::log(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageService.so
#6  0x0000ffffbb05c230 in edm::service::ThreadSafeLogMessageLoggerScribe::runCommand(edm::MessageLoggerQ::OpCode, void*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageService.so
#7  0x0000ffffbd63ea98 in edm::MessageSender::ErrorObjDeleter::operator()(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#8  0x0000ffffbd6415c8 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#9  0x0000ffffbd63d5e4 in edm::MessageSender::~MessageSender() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#10 0x0000ffff46b857a4 in TrackingRecoMaterialAnalyser::analyze(edm::Event const&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginDQMTrackingMonitor.so

Thread 4 (Thread 0xffff4a3283c0 (LWP 22150)):
#0  0x0000ffffbbb2a9c4 in nanosleep () from /lib64/libc.so.6
#1  0x0000ffffbbb2a678 in sleep () from /lib64/libc.so.6
#2  0x0000ffffb6f559e0 in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000ffffbc0b33c8 in ?? () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libz.so.1
#5  0x0000ffffbc0b4c24 in ?? () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libz.so.1
#6  0x0000ffffbc0b5b7c in deflate () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libz.so.1
#7  0x0000ffffbc80e194 in R__zipMultipleAlgorithm () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libCore.so
#8  0x0000ffffbd333328 in TBasket::WriteBuffer() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libTree.so
#9  0x0000ffffbd3409ac in std::_Function_handler<void (), ROOT::Internal::TBranchIMTHelper::Run<TBranch::WriteBasketImpl(TBasket*, int, ROOT::Internal::TBranchIMTHelper*)::{lambda()#1}>(TBranch::WriteBasketImpl(TBasket*, int, ROOT::Internal::TBranchIMTHelper*)::{lambda()#1} const&)::{lambda()#1}>::_M_invoke(std::_Any_data const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libTree.so
#10 0x0000ffffbba680fc in tbb::detail::d1::function_task<std::function<void ()> >::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libImt.so
#11 0x0000ffffbc13a778 in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::external_waiter> (this=this@entry=0xffffbb28fe00, t=0xfffe0eec9400, t@entry=0x0, waiter=...) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_machine.h:356
#12 0x0000ffffbc138a74 in tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::external_waiter> (waiter=..., t=<optimized out>, this=0xffffbb28fe00) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/governor.h:149
#13 tbb::detail::r1::task_dispatcher::execute_and_wait (t=0x0, wait_ctx=..., w_ctx=...) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/task_dispatcher.cpp:168
#14 0x0000ffffbba67f50 in ROOT::Experimental::TTaskGroup::Wait() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libImt.so
#15 0x0000ffffbd3b5798 in TTree::Fill() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/external/slc7_aarch64_gcc9/lib/libTree.so
#16 0x0000ffff408a7158 in tbb::detail::d1::task_arena_function<edm::RootOutputTree::fillTree()::{lambda()#1}, void>::operator()() const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libIOPoolOutput.so
#17 0x0000ffffbc1286a0 in tbb::detail::r1::<lambda()>::operator() (__closure=<optimized out>) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:747
#18 tbb::detail::d0::try_call_proxy<tbb::detail::r1::isolate_within_arena(tbb::detail::d1::delegate_base&, intptr_t)::<lambda()> >::on_completion<tbb::detail::r1::isolate_within_arena(tbb::detail::d1::delegate_base&, intptr_t)::<lambda()> > (on_completion_body=..., this=<optimized out>) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/../../include/oneapi/tbb/detail/_template_helpers.h:220
#19 tbb::detail::r1::isolate_within_arena (d=..., isolation=<optimized out>) at /home/cmsbuild/jenkins_b/workspace/auto-builds/CMSSW_12_0_0_pre1-slc7_aarch64_gcc9/build/CMSSW_12_0_0_pre1-build/BUILD/slc7_aarch64_gcc9/external/tbb/v2021.2.0/tbb-v2021.2.0/src/tbb/arena.cpp:748
#20 0x0000ffff408a7a2c in edm::RootOutputTree::fillTree() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libIOPoolOutput.so
#21 0x0000ffff4089f4ec in edm::RootOutputFile::fillBranches(edm::BranchType const&, edm::OccurrenceForOutput const&, std::vector<edm::StoredProductProvenance, std::allocator<edm::StoredProductProvenance> >*, edm::ProductProvenanceRetriever const*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libIOPoolOutput.so
#22 0x0000ffff408a1ef8 in edm::RootOutputFile::writeOne(edm::EventForOutput const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libIOPoolOutput.so
#23 0x0000ffff40882f94 in edm::PoolOutputModule::write(edm::EventForOutput const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libIOPoolOutput.so
#24 0x0000ffffbdbd939c in edm::one::OutputModuleBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreFramework.so

Thread 3 (Thread 0xffff4ad383c0 (LWP 22149)):
#0  0x0000ffffbbb2a9c4 in nanosleep () from /lib64/libc.so.6
#1  0x0000ffffbbb2a678 in sleep () from /lib64/libc.so.6
#2  0x0000ffffb6f559e0 in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000ffffbbb08dcc in _wordcopy_fwd_aligned () from /lib64/libc.so.6
#5  0x0000ffffbbb08d94 in memcpy () from /lib64/libc.so.6
#6  0x000000000040ff30 in void std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_construct<char*>(char*, char*, std::forward_iterator_tag) ()
#7  0x0000ffffbd63b504 in edm::messagedrop::StringProducerWithPhase::theContext[abi:cxx11]() const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#8  0x0000ffffbd63add0 in edm::MessageDrop::moduleContext[abi:cxx11]() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#9  0x0000ffffbd63ea0c in edm::MessageSender::ErrorObjDeleter::operator()(edm::ErrorObj*) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#10 0x0000ffffbd6415c8 in std::_Sp_counted_deleter<edm::ErrorObj*, edm::MessageSender::ErrorObjDeleter, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#11 0x0000ffffbd63d5e4 in edm::MessageSender::~MessageSender() () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libFWCoreMessageLogger.so
#12 0x0000ffff46b857a4 in TrackingRecoMaterialAnalyser::analyze(edm::Event const&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginDQMTrackingMonitor.so

Thread 1 (Thread 0xffffbb390000 (LWP 19438)):
#0  0x0000ffffbbb2a9c4 in nanosleep () from /lib64/libc.so.6
#1  0x0000ffffbbb2a678 in sleep () from /lib64/libc.so.6
#2  0x0000ffffb6f559e0 in sig_pause_for_stacktrace () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginFWCoreServicesPlugins.so
#3  <signal handler called>
#4  0x0000ffff5d1d6164 in MultipleScatteringUpdator::compute(TrajectoryStateOnSurface const&, PropagationDirection, materialEffect::Effect&) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsMaterialEffects.so
#5  0x0000ffff5d1d4978 in CombinedMaterialEffectsUpdator::compute(TrajectoryStateOnSurface const&, PropagationDirection, materialEffect::Effect&) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsMaterialEffects.so
#6  0x0000ffff5d1d5548 in MaterialEffectsUpdator::updateStateInPlace(TrajectoryStateOnSurface&, PropagationDirection) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsMaterialEffects.so
#7  0x0000ffff5d1d70fc in PropagatorWithMaterial::propagateWithPath(TrajectoryStateOnSurface const&, Plane const&) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsMaterialEffects.so
#8  0x0000ffffb5793e40 in Propagator::propagateWithPath(TrajectoryStateOnSurface const&, Surface const&) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsGeomPropagators.so
#9  0x0000ffff5b63992c in KFTrajectoryFitter::fitOne(TrajectorySeed const&, std::vector<std::shared_ptr<TrackingRecHit const>, std::allocator<std::shared_ptr<TrackingRecHit const> > > const&, TrajectoryStateOnSurface const&, TrajectoryFitter::fitType) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libTrackingToolsTrackFitters.so
#10 0x0000ffff5b69cb58 in (anonymous namespace)::KFFittingSmoother::fitOne(TrajectorySeed const&, std::vector<std::shared_ptr<TrackingRecHit const>, std::allocator<std::shared_ptr<TrackingRecHit const> > > const&, TrajectoryStateOnSurface const&, TrajectoryFitter::fitType) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginTrackingToolsTrackFittersPlugins.so
#11 0x0000ffff5b68b6c0 in (anonymous namespace)::FlexibleKFFittingSmoother::fitOne(TrajectorySeed const&, std::vector<std::shared_ptr<TrackingRecHit const>, std::allocator<std::shared_ptr<TrackingRecHit const> > > const&, TrajectoryStateOnSurface const&, TrajectoryFitter::fitType) const () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginTrackingToolsTrackFittersPlugins.so
#12 0x0000ffff46452e34 in TrackProducerAlgorithm<reco::Track>::buildTrack(TrajectoryFitter const*, Propagator const*, std::vector<AlgoProductTraits<reco::Track>::AlgoProduct, std::allocator<AlgoProductTraits<reco::Track>::AlgoProduct> >&, std::vector<std::shared_ptr<TrackingRecHit const>, std::allocator<std::shared_ptr<TrackingRecHit const> > >&, TrajectoryStateOnSurface&, TrajectorySeed const&, float, reco::BeamSpot const&, edm::RefToBase<TrajectorySeed>, int, signed char) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/libRecoTrackerTrackProducer.so
#13 0x0000ffff465b2a88 in TrackProducerAlgorithm<reco::Track>::runWithCandidate(TrackingGeometry const*, MagneticField const*, std::vector<TrackCandidate, std::allocator<TrackCandidate> > const&, TrajectoryFitter const*, Propagator const*, TransientTrackingRecHitBuilder const*, reco::BeamSpot const&, std::vector<AlgoProductTraits<reco::Track>::AlgoProduct, std::allocator<AlgoProductTraits<reco::Track>::AlgoProduct> >&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginRecoTrackerTrackProducerPlugins.so
#14 0x0000ffff465affe0 in TrackProducer::produce(edm::Event&, edm::EventSetup const&) () from /cvmfs/cms-ib.cern.ch/week1/slc7_aarch64_gcc9/cms/cmssw/CMSSW_12_0_X_2021-05-16-0000/lib/slc7_aarch64_gcc9/pluginRecoTrackerTrackProducerPlugins.so

Current Modules:

Module: TrackingRecoMaterialAnalyser:materialDumperAnalyzer (crashed)
Module: TrackProducer:initialStepTracksPreSplitting
Module: PoolOutputModule:RECOoutput
Module: TrackingRecoMaterialAnalyser:materialDumperAnalyzer

A fatal system signal has occurred: segmentation violation

@dan131riley
Copy link
Author

Can we try cherry picking oneapi-src/oneTBB#435? Use after scope on the allocator is a plausible explanation for our crashes.

@alexey-katranov
Copy link

Can we try cherry picking oneapi-src/oneTBB#435? Use after scope on the allocator is a plausible explanation for our crashes.

I'd better suggest trying the current master (82ff8707 in particular).

@mrodozov
Copy link
Contributor

sure I'll get it.

@mrodozov
Copy link
Contributor

mrodozov commented Jun 17, 2021

Cherry-picking the commits from the oneTBB PR 435 didn't compile on top of our 2021.2.0 (breaking changes and change of namespace name which I didn't go to try and understand, and eventually stitch).
I got master+PR435 commits+ppc macro fix
here:
https://github.com/mrodozov/oneTBB/commits/master%2Bfix
this builds on PPC (tried it)

mrodozov added a commit to cms-sw/cmsdist that referenced this issue Jun 17, 2021
See cms-sw/cmssw#33636 (comment)
we don't have tbb external and we don't need it for this temp test on PPC
I've used my fork
@mrodozov
Copy link
Contributor

We built an IB CMSSW_12_0/2021-06-19-1100 for slc7_ppc64le_gcc9 that doesn't have "the usual" MessageLogger failures
looks like this changes are fixing the issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants