Comparison failed for tests of wf `39434.911` #37315

francescobrivio · 2022-03-23T09:49:24Z

In recent PRs test results there is a message saying:

comparisons for the following workflows were not done due to missing matrix map:
 - /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-cadc83/39434.911_TTbar_14TeV+2026D88_DD4hep+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

But nonetheless the default comparison reports more than 50k differences for this wf.
Some examples:

The text was updated successfully, but these errors were encountered:

cmsbuild · 2022-03-23T09:49:48Z

A new Issue was created by @francescobrivio .

@Dr15Jones, @perrotta, @dpiparo, @makortel, @smuzaffar, @qliphy can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

francescobrivio · 2022-03-23T09:50:12Z

assign reconstruction

cmsbuild · 2022-03-23T09:50:36Z

New categories assigned: reconstruction

@jpata,@slava77,@clacaputo you have been requested to review this Pull request/Issue and eventually sign? Thanks

jpata · 2022-03-23T12:28:58Z

.911 is dd4hep, we often had differences in it not related to reco. did something change, or is it still a geometry-related issue?

makortel · 2022-03-23T12:59:54Z

I believe the message

comparisons for the following workflows were not done due to missing matrix map:
 - /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-cadc83/39434.911_TTbar_14TeV+2026D88_DD4hep+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

refers only to the "validateJR" / "reco comparisons", and not to RelMon-based comparisons or DQM bin by bin.

makortel · 2022-03-23T13:00:23Z

assign geometry

cmsbuild · 2022-03-23T13:00:44Z

New categories assigned: geometry

@cvuosalo,@mdhildreth,@ianna,@Dr15Jones,@makortel,@civanch you have been requested to review this Pull request/Issue and eventually sign? Thanks

francescobrivio · 2022-03-23T13:04:15Z

.911 is dd4hep, we often had differences in it not related to reco. did something change, or is it still a geometry-related issue?

@jpata from the alca point of view there was no change in the MC dd4hep geometry. Thanks Matti for assigning geometry.

perrotta · 2022-03-23T13:13:50Z

I believe the message
comparisons for the following workflows were not done due to missing matrix map:
 - /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-cadc83/39434.911_TTbar_14TeV+2026D88_DD4hep+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal
refers only to the "validateJR" / "reco comparisons", and not to RelMon-based comparisons or DQM bin by bin.

There are however also quite a lot of differences in the bin by bin DQM comparisons for that workflow in the test outputs, as you can verify by opening any of the links to the PR tests listed in the issue description above.

makortel · 2022-03-23T13:22:17Z

I believe the message
comparisons for the following workflows were not done due to missing matrix map:
 - /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-cadc83/39434.911_TTbar_14TeV+2026D88_DD4hep+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal
refers only to the "validateJR" / "reco comparisons", and not to RelMon-based comparisons or DQM bin by bin.
There are however also quite a lot of differences in the bin by bin DQM comparisons for that workflow in the test outputs, as you can verify by opening any of the links to the PR tests listed in the issue description above.

Right, but the existence of the differences (whose cause should be identified) is a separate issue from one piece of comparisons infrastructure not recognizing this workflow.

cvuosalo · 2022-03-23T15:42:30Z

39434.911 is Phase 2 D88 DD4hep. It is quite new and may be still under development.
@srimanob Could you please comment about the stability of this workflow?

This workflow runs DD4hep from XML files. That makes it very sensitive to any perturbations in the source files or test process.

cvuosalo · 2022-03-23T15:42:38Z

assign upgrade

srimanob · 2022-03-23T16:05:20Z

How can we test the stability of the workflow offline? I mean how do we know if some unexpected random behavior will happen somewhere without this kind of PR test.

The test was done with ttbar 9k events, but that is to compare between DDD and DD4hep. I never try to compare DD4hep with DD4hep from 2 runs to see its stability.

francescobrivio · 2022-03-23T16:20:27Z

@cvuosalo @srimanob what I find strange is that in all the tests there are ~50k failed differences over ~100k total comparisons (see this log for example). So, if basically half of the comparisons fail, maybe it's not just a "stability" problem?

cmsbuild · 2022-03-23T16:29:16Z

New categories assigned: upgrade

@AdrianoDee,@srimanob you have been requested to review this Pull request/Issue and eventually sign? Thanks

srimanob · 2022-03-23T16:35:40Z

@francescobrivio
Yes, I saw it. I can comment it out for now. Clearly, this is not expected. However, as I said, I still have no idea why and how I can test it offline. SInce now we allow to create baseline in PR test (#37289), this may help when I try to enable it.

makortel · 2022-03-23T19:06:57Z

@srimanob Could you elaborate what you mean with "test it offline"? Is it e.g. about comparing the DQM root files of different invocations of the workflow?

srimanob · 2022-03-23T22:02:44Z

Hi @makortel
I mean I don't expect the change in .911 DD4hep phase-2 wf. With the test results on several PRs that show failure on comparison, should I test it locally somehow. For example, run twices with and without PR and compare locallly?

By the way, last fresh test of the following PR does not show perculiar on the workflow comparison,

Add HLT75e33 step to Phase-2 workflows #37324

makortel · 2022-03-23T23:40:49Z

For example, run twices with and without PR and compare locallly?

Right, run twice or many times. The comparison failure appears to be random, so it is hard to say beforehand how many times to run. In the past we've seen occurrence rates between O(1 %) and O(100 %) or so with the (Run 3) DD4Hep workflow.

clacaputo · 2022-03-24T09:46:05Z

comparisons for the following workflows were not done due to missing matrix map:
 - /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-cadc83/39434.911_TTbar_14TeV+2026D88_DD4hep+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

I think this problem should be addressed by this cms-sw/cms-bot@31e5f20

srimanob · 2022-03-24T16:26:03Z

I've prepared the pull request to disable it in case we still face random issue of failure comparison.
#37337

jpata · 2022-05-16T13:01:45Z

+reconstruction

looks like the 39434.911 was disabled

francescobrivio · 2022-05-16T13:08:19Z

Wf disabled from short matrix in #37337

cmsbuild added the pending-assignment label Mar 23, 2022

cmsbuild added pending-signatures reconstruction-pending and removed pending-assignment labels Mar 23, 2022

This was referenced Mar 23, 2022

Introduce generalized abstraction for Strip Payload Inspector #37265

Merged

Ecal phisym run3 workflow #36988

Merged

cmsbuild added the geometry-pending label Mar 23, 2022

cmsbuild added the upgrade-pending label Mar 23, 2022

srimanob mentioned this issue Mar 24, 2022

Disable DD4hep Phase2 D88 in short matrix #37337

Merged

malbouis mentioned this issue Mar 25, 2022

Update L1T menu tag #37335

Merged

francescobrivio mentioned this issue Mar 26, 2022

further miscellaneous clean-up / improvements to SiStrip payload inspector #37333

Merged

cmsbuild added reconstruction-approved and removed reconstruction-pending labels May 16, 2022

francescobrivio closed this as completed May 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparison failed for tests of wf `39434.911` #37315

Comparison failed for tests of wf `39434.911` #37315

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

jpata commented Mar 23, 2022

makortel commented Mar 23, 2022

makortel commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

francescobrivio commented Mar 23, 2022

perrotta commented Mar 23, 2022

makortel commented Mar 23, 2022

cvuosalo commented Mar 23, 2022

cvuosalo commented Mar 23, 2022

srimanob commented Mar 23, 2022 •

edited

Loading

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

srimanob commented Mar 23, 2022

makortel commented Mar 23, 2022

srimanob commented Mar 23, 2022

makortel commented Mar 23, 2022

clacaputo commented Mar 24, 2022

srimanob commented Mar 24, 2022

jpata commented May 16, 2022

francescobrivio commented May 16, 2022

Comparison failed for tests of wf 39434.911 #37315

Comparison failed for tests of wf 39434.911 #37315

Comments

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

jpata commented Mar 23, 2022

makortel commented Mar 23, 2022

makortel commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

francescobrivio commented Mar 23, 2022

perrotta commented Mar 23, 2022

makortel commented Mar 23, 2022

cvuosalo commented Mar 23, 2022

cvuosalo commented Mar 23, 2022

srimanob commented Mar 23, 2022 • edited Loading

francescobrivio commented Mar 23, 2022

cmsbuild commented Mar 23, 2022

srimanob commented Mar 23, 2022

makortel commented Mar 23, 2022

srimanob commented Mar 23, 2022

makortel commented Mar 23, 2022

clacaputo commented Mar 24, 2022

srimanob commented Mar 24, 2022

jpata commented May 16, 2022

francescobrivio commented May 16, 2022

Comparison failed for tests of wf `39434.911` #37315

Comparison failed for tests of wf `39434.911` #37315

srimanob commented Mar 23, 2022 •

edited

Loading