-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Final modifications previous commit #106
Final modifications previous commit #106
Conversation
MRD2F
commented
Nov 13, 2018
- Versioning of DPF & consistency check
- Make quantized versions as default ones
- General code review and cleaning.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the update. Please check my comment here: cms-tau-pog/RecoTauTag-TrainingFiles#5 (comment) and adjust to it if you agree. See also a few cosmetics requests - they are not mandatory but can be implemented together with other changes.
file_name = 'RecoTauTag/TrainingFiles/data/DPFTauId/DPFIsolation_2017v0.pb' | ||
self.process.dpfTau2016v0 = self.cms.EDProducer("DPFIsolation", | ||
pfcands = self.cms.InputTag('packedPFCandidates'), | ||
taus = self.cms.InputTag('slimmedTaus'), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cosmetics: please adjust =
together with other changes.
file_name = 'RecoTauTag/TrainingFiles/data/DPFTauId/DPFIsolation_2017v1.pb' | ||
self.process.dpfTau2016v1 = self.cms.EDProducer("DPFIsolation", | ||
pfcands = self.cms.InputTag('packedPFCandidates'), | ||
taus = self.cms.InputTag('slimmedTaus'), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cosmetics: please adjust =
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please find comments inline - cosmetics.
@@ -249,7 +249,7 @@ class DeepTauId : public deep_tau::DeepTauBase { | |||
desc.add<edm::InputTag>("electrons", edm::InputTag("slimmedElectrons")); | |||
desc.add<edm::InputTag>("muons", edm::InputTag("slimmedMuons")); | |||
desc.add<edm::InputTag>("taus", edm::InputTag("slimmedTaus")); | |||
desc.add<std::string>("graph_file", "RecoTauTag/TrainingFiles/data/DeepTauId/deepTau_2017v1_20L1024N_quantized.pb"); | |||
desc.add<std::string>("graph_file", "RecoTauTag/TrainingFiles/data/DeepTauId/deepTau_2017v1_20L1024N.pb"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
quantized
should be default, do you agree?
@@ -47,7 +47,7 @@ class DPFIsolation : public deep_tau::DeepTauBase { | |||
desc.add<edm::InputTag>("pfcands", edm::InputTag("packedPFCandidates")); | |||
desc.add<edm::InputTag>("taus", edm::InputTag("slimmedTaus")); | |||
desc.add<edm::InputTag>("vertices", edm::InputTag("offlineSlimmedPrimaryVertices")); | |||
desc.add<std::string>("graph_file", "RecoTauTag/TrainingFiles/data/DPFTauId/DPFIsolation_2017v0_quantized.pb"); | |||
desc.add<std::string>("graph_file", "RecoTauTag/TrainingFiles/data/DPFTauId/DPFIsolation_2017v0.pb"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
quantized
should be default, do you agree?
|
||
|
||
def getDpfTauVersion(self, file_name): | ||
version_search = re.search('201[125678]v([0-9]+)[\._]', file_name) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is quite clear, but please add a comment about expected pattern in a file name for those who might not understand regex syntax; something like this: "File name should contain a version label with data takig year (2011-2, 2015-8) and version number (vX), e.g. 2017v0."
def getDpfTauVersion(self, file_name): | ||
version_search = re.search('201[125678]v([0-9]+)[\._]', file_name) | ||
if not version_search: | ||
raise RuntimeError('File "{}" has an invalid name pattern. Unable to extract version number.'.format(file_name)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add to description of the error information on an expected version label as in a comment above.
Tested, results as in version before final cleaning provided via #105 |
* First implementation of deep tau id. * Building dpf isolation module * Adding in v1 * Adding in runTauIDMVA for other users * making things fully reproducible * Reorganisation of configuration files: cff split to cfi and cff * Some code cleaning * adapt to cfi/cff reorganization * Review of DPF and DeepTauId code. - Defined base class for deep tau discriminators. - Removed weight files from home cms repository. Now using weights from cms-data. - Defined WP for both discriminators. Now all discriminators return the corresponding WP results. - Removed cfi files. Using fillDescriptions instead. - General code review and cleaning. * Added example of a python configuration file to produce pat::Tau collection with the new Tau-Ids * requested changes on runDeepTauIDsOnMiniAOD.py * Clean runTauIdMVA.py tool and test config to run tauIDs * Made DeepTauId and DPFIsolation thread-safe * Finish implement thread-safe requirements on DPFIsolation * Disable DPFTau_2016_v1 and issue some warnings * Remove assigning value of variable to itself * - Implemented on runTauIdMVA the option to work with new training files quantized - Added a new parameter 'version' on runTauIdMVA, used on DPFIsolation - Changes on DeepTauId to reduce memory consumption * - Implementation of global cache to avoid reloading graph for each thread and reduce the memory consuption - Creation of class DeepTauCache in DeepTauBase, in which now is created graph and session - Implementation of two new static methods inside the class DeepTauBase: initializeGlobalCache and globalEndJob. The graph and DeepTauCache object are created now inside initializeGlobalCache * Applied changes on DeepTauBase to allow load new training files using memory mapping * Implemented TauWPThreshold class. TauWPThreshold class parses WP cut string (or value) provided in the python configuration. It is needed because the use of the standard StringObjectFunction class to parse complex expression results in an extensive memory usage (> 100 MB per expression). * Remove the qm.pb input files and leaving just the quantized and the original files * -Overall, changes to improve memory usage, among these are: - Implementation of global cache to avoid reloading graph for each thread - Creation of two new static methods inside the class DeepTauBase: initializeGlobalCache and globalEndJob. The graph and DeepTauCache object are created now inside initializeGlobalCache. The memory consumption of initializeGlobalCache for the original, quantized and files that are load using memory mapping method are in the memory_usage.pdf file - Implemented configuration to use new training files quantized, and set them as default - Implementation of configuration for load files using memory mapping. In our case there wasn't any improvement, respect at the memory consumption of this method, respect the quantized files, so this is not used, but set for future training files - General code review and cleaning. * Applied style comments * Applied style comments * Applied comments * Change to be by default the original training file for deepTau, instead of the quantized * Changes regarding forward-porting DNN-related developments from the PRs #105 and #106 from 94X to 104X * Applied commets of previus PR * cleaning code * Modification in the config to work with new label in files * Applied comment about the expected format of name of training file * Fix in last commit * Applied last comments * Changes regarding forward-porting DNN-related developments from the PRs #105 and #106 from 94X to 104X * Applied @perrotta comments on 104X * Fix error * Applied comments * Applied comments * Fix merge problem * Applied a few commets * Applied more changes * Applied a few small followups * Fixed error on DPFIsolation * Update DPFIsolation.cc * - RecoTauTag/RecoTau/plugins/DeepTauId.cc: Remove ' clusterVariables 'as a class member - RecoTauTag/RecoTau/test/runDeepTauIDsOnMiniAOD.py: Update globaltag and sample * Added changes in RecoTauTag/RecoTau/python/tools/runTauIdMVA.py made in the commit 194a1d5 from the PR cms-sw#25016 * Fix error on runDeepTauIDsOnMiniAOD * Change the GT in RecoTauTag/RecoTau/test/runDeepTauIDsOnMiniAOD.py