Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HLTrigger/Timer: add function FastTimerService::fix_for_dqm which replaces check for illegal dqm characters for path so it can be used on paths and labels. #38963

Conversation

gartung
Copy link
Member

@gartung gartung commented Aug 3, 2022

HLTrigger/Timer: add function FastTimerService::fix_for_dqm which replaces check for illegal dqm characters for path so it can be used on paths and labels.

Fixes this error seen when running FastTimerService on a standalone process

03-Aug-2022 15:52:42 CEST  Initiating request to open file /data/cmsbld/jenkins/workspace/ib-run-profiling/file:step2.root
03-Aug-2022 15:52:48 CEST  Successfully opened file /data/cmsbld/jenkins/workspace/ib-run-profiling/file:step2.root
2022-08-03 15:52:59.070874: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  SSE4.1 SSE4.2 AVX AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
----- Begin Fatal Exception 03-Aug-2022 15:53:07 CEST-----------------------
An exception of category 'BadMonitorElementPathName' occurred while
   [0] Calling EventProcessor::runToCompletion (which does almost everything after beginJob and before endJob)
Exception Message:
 Monitor element path name: 'merService/process RECO modules/ecalDigis@cpu time_thread' uses unacceptable characters.
 Acceptable characters are: /ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789-+=_()# 
   Additional Info:
      [a] Another exception was caught while trying to clean up files after the primary fatal exception.
----- End Fatal Exception -------------------------------------------------
03-Aug-2022 15:53:07 CEST  Closed file /data/cmsbld/jenkins/workspace/ib-run-profiling/file:step2.root

The error was caused by the ecalDigis@cpu label. The @ character is used in SwitchProducer labels.

Validated locally using profiling scripts.

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2022

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-38963/31411

  • This PR adds an extra 28KB to repository

Code check has found code style and quality issues which could be resolved by applying following patch(s)

…laces check for illegal dqm characters for path so it can be used on paths and labels.
@gartung gartung force-pushed the gartung-HLTrigger-Timer-FastTimerService-fix-module-label-for-dqm branch from 1b29262 to 954a95d Compare August 3, 2022 20:40
@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2022

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-38963/31412

  • This PR adds an extra 28KB to repository

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 3, 2022

A new Pull Request was created by @gartung (Patrick Gartung) for master.

It involves the following packages:

  • HLTrigger/Timer (hlt)

@cmsbuild, @missirol, @Martin-Grunewald can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @missirol, @silviodonato this is something you requested to watch as well.
@perrotta, @dpiparo, @qliphy, @rappoccio you are the release manager for this.

cms-bot commands are listed here

@gartung
Copy link
Member Author

gartung commented Aug 3, 2022

enable profiling

@gartung
Copy link
Member Author

gartung commented Aug 3, 2022

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 4, 2022

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d69cf0/26629/summary.html
COMMIT: 954a95d
CMSSW: CMSSW_12_5_X_2022-08-03-1100/el8_amd64_gcc10
Additional Tests: PROFILING
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/38963/26629/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 2 differences found in the comparisons
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3691510
  • DQMHistoTests: Total failures: 8
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3691480
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 50 files compared)
  • Checked 212 log files, 49 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@missirol
Copy link
Contributor

missirol commented Aug 4, 2022

+hlt

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 4, 2022

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

@qliphy
Copy link
Contributor

qliphy commented Aug 4, 2022

+1

@cmsbuild cmsbuild merged commit b4411e1 into cms-sw:master Aug 4, 2022
@@ -731,7 +731,7 @@ void FastTimerService::PlotsPerJob::book(dqm::reco::DQMStore::IBooker& booker,
if (bymodule) {
booker.setCurrentFolder(basedir + "/process " + process.name_ + " modules");
for (unsigned int id : process.modules_) {
auto const& module_name = job.module(id).moduleLabel();
auto const& module_name = fix_for_dqm(job.module(id).moduleLabel());
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@fwyzard The change is here.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks, I guess I missed it !

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I force pushed the formatting update. Sometimes github does not register force pushed changes.

@fwyzard
Copy link
Contributor

fwyzard commented Aug 4, 2022

however the signature and merge was a bit too eager :-(

@fwyzard
Copy link
Contributor

fwyzard commented Aug 4, 2022

please see #38966

@missirol
Copy link
Contributor

missirol commented Aug 4, 2022

however the signature and merge was a bit too eager :-(

Sorry. Thanks for the PR, Andrea.

@gartung gartung deleted the gartung-HLTrigger-Timer-FastTimerService-fix-module-label-for-dqm branch August 22, 2022 20:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants