-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Empty Trajectory exception in HLT step in many workflows #35488
Comments
assign hlt, reconstruction |
New categories assigned: hlt,reconstruction @slava77,@Martin-Grunewald,@jpata,@missirol you have been requested to review this Pull request/Issue and eventually sign? Thanks |
A new Issue was created by @makortel Matti Kortelainen. @Dr15Jones, @perrotta, @dpiparo, @makortel, @smuzaffar, @qliphy can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
#35309 looks like a likely cause |
FYI @swagata87 (in case she has a guess on what the right fix is) |
I am running I think I should have put a check like this is what I suspect.. checking.. |
I could reproduce the issue in I'll check with other workflows reported here also ( The WFs are taking a long time to run, but I think I will manage to submit the PR by tonight.. Sorry about the problems caused by my earlier PR |
@vmariani @mmusich IIUC, @swagata87 is available to check the fit. Thank you. |
@swagata87, in the interest of time, since the fix seems well understood, you could open the PR already, have it reviewed, and have the tests run automatically (incl. the additional wfs), which will be done anyway later. (But unless any of the experts agrees, please proceed with your current plan.) |
I could agree with that. |
How this could ever be happened? |
Is there a test we can add in the short matrix to catch this early? As I understand, some of the short matrix wfs run HLT, so it's not clear to me why the tests didn't fail. |
Noting here that the culprit PR was signed by HLT. I'm happy to hear the suggestions of experts how we can improve test coverage to catch this early. |
PR tests did not catch this bug, plain and simple. Some bugs are only uncovered by IB tests as they involve many more workflows/statistics. |
The list of failing workflows over IBs also suggests that there is some randomness there, possibly caused by running the DIGI step multithreaded (the DIGI step random number sequence for a given SIM event can change according to the EDM stream the SIM event gets processed by in the DIGI step, and that "stream assignment" can be affected e.g. my machine load). |
Anyway, the fix has been merged, so closing the issue. |
(At least) 5 workflows fail in the HLT step with exception like below
HLT_DoubleTrkMu_16_6_NoFiltersNoVtx_v1
moduleHLT_DoubleTrkMu_16_6_NoFiltersNoVtx_v1
(log)HLT_DoubleTrkMu_16_6_NoFiltersNoVtx_v1
modulehltIterL3OITrackCandidatesNoVtx
(log)HLT_DoubleTrkMu_16_6_NoFiltersNoVtx_v1
modulehltIterL3OITrackCandidatesNoVtx
(log)HLT_TrkMu6NoFiltersNoVtx_v1
modulehltIterL3OITrackCandidatesNoVtx
(log)HLT_OldMu100_v3
modulehltL3TrackCandidateFromL2OIState
(log)The text was updated successfully, but these errors were encountered: