-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DeepTau v2p5 in nanoAOD [12_4_X] #38751
DeepTau v2p5 in nanoAOD [12_4_X] #38751
Conversation
A new Pull Request was created by @mbluj for CMSSW_12_4_X. It involves the following packages:
@gouskos, @clacaputo, @cmsbuild, @fgolf, @jpata, @mariadalfonso can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
@cmsbuild please test |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-220b83/26270/summary.html Comparison SummarySummary:
|
from_raw=True, wp_thrs=WORKING_POINTS_v2p5["mu"]), | ||
idDeepTau2018v2p5VSjet = _tauIdWPMask("byDeepTau2018v2p5VSjetraw", | ||
choices=("VVVLoose","VVLoose","VLoose","Loose","Medium","Tight","VTight","VVTight"), | ||
doc="byDeepTau2018v2p5VSjet ID working points (deepTau2018v2p5)", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"VVTight": 0.9733927, | ||
}, | ||
} | ||
workingPoints_ = WORKING_POINTS_v2p1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
similarly in master this is different
@mariadalfonso Yes, it is expected because the way IDs are stored is changed from bitmask to WP numbering, as described in the PR description. |
You mean both the tau and boosted tau v2.1 quantities are expected to change ?
technically are doing different things, one re-run the ID and one read from MINI, |
v2p1 (and other old ids) and their WPs are the same, but the way WPs are stored has changed. Just for illustration, lets suppose we have 3 WPs for some tau ID discriminator: Loose, Medium and Tight. In the old code the value of id branch would be: |
yes I can see, but means that for run3: if someone do analysis from mini or nano we get two different thing Can we push the change in bit-map for the Run3 mini as well in a separate PRs ? |
@mariadalfonso sorry I don't understand your point about Run 3 mini vs. nano confusion... In miniAOD, WP results are accessible as |
ok, good to know that there is a 3rd way to get the ID. |
yes, it can be done. I'll prepare a PR with bitmap->numbering modification shortly. |
@mariadalfonso, @kandrosov, my two cents on differences in tauIDs other than newly added deepTau v2p5: It is indeed true that the main change is caused by a modified way of storing of WPs (bitset->numbers), but one should also expect some change caused by the different selection of taus stored in nanoAOD. The selection bases on the "big OR" of loosest WPs of all discriminants against jet->tau fakes and now it contains also deepTau v2p5 VVVL: https://github.com/cms-sw/cmssw/pull/38751/files#diff-d54676262d2e5326ee3455e57747fd476b12be16993a8eb0f4794e8771f6526fR18 |
Pull request #38751 was updated. @gouskos, @clacaputo, @cmsbuild, @fgolf, @jpata, @mariadalfonso can you please check and sign again. |
@mariadalfonso, ece4723 adds a customization function to add items missing in Run-2 UL samples needed by nano v10 (now only deepTau v2p5); it is a counterpart of dd40361 described in #38726 (comment). |
-1 Failed Tests: RelVals-INPUT RelVals-INPUT
Comparison SummarySummary:
|
Seems that the mini+nano in one step fails 136.72412 Our test with persistent MINI are ok @mbluj can you have a look ? |
Yes, I am on it. The problem is indeed due to the common (re)mini+nano workflow as at both steps deepTauID is added to taus. At mini it is default with 125X/124X, while at nano it is triggered by an era modifier. There are two problems with this setup: one is technical as modules with same names are crated, but different inputs are expected (modules are in different place in "production chain"), while other is "philosophical" as deepTauID is tried to be run twice. The first one can be solved with some effort by adding a suffix to module names, while the other is more difficult to avoid without an prior knowledge on combination of workflows, i.e. production levels (here mini & nano) and eras used. P.S. The same issue affects PR to master. |
…e names and different inputs
Pull request #38751 was updated. @gouskos, @swertz, @vlimant, @clacaputo, @cmsbuild, @jpata, @mariadalfonso can you please check and sign again. |
please test |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-220b83/26600/summary.html Comparison SummarySummary:
|
+xpog
|
+reconstruction |
This pull request is fully signed and it will be integrated in one of the next CMSSW_12_4_X IBs (tests are also fine) and once validation in the development release cycle CMSSW_12_5_X is complete. This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy, @rappoccio (and backports should be raised in the release meeting by the corresponding L2) |
+1 |
PR description:
This PR adds working point definitions for a newly integrated DeepTau v2p5.
A corresponding study with details about WP threshold derivation and tau efficiency/mis-ID rate plots for both Run2 UL and Run 3 samples can be found here.
This PR is a backport of #38726 to 12_4_X.
In particular, the changes in this PR are (as in #38726):
_deepTauVars2018v2p5
totaus_cff.py
,_tauIdWPMask()
;from_raw
argument in_tauIdWPMask()
) given the threshold values, instead of reading them directly from MINIAOD;Differences wrt #38726:
PR validation:
Original PR successfully tested with the "limited" set of matrix tests and a custom nanoAOD production. Matrix tests of this PR ongoing - we do not expect failures and will update this description when tests are finished.
If this PR is a backport please specify the original PR and why you need to backport that PR.
This PR is a backport of #38726 to 12_4_X, introduces deepTauID v2p5 to nanoAOD v10.