-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Further clean up the CA ntuplet generator #610
Further clean up the CA ntuplet generator #610
Conversation
This PR tries to address the last few comments from https://github.com/cms-sw/cmssw/pull/31722/files#diff-eadc36519665df48f9724340a6037730e07f8d896606bdf89707acb65a368736 . |
// in principle we can use "nhits" to heuristically dimension the workspace... | ||
// overkill to use template here (std::make_unique would suffice) | ||
// device_isOuterHitOfCell_ = Traits:: template make_unique<GPUCACell::OuterHitOfCell[]>(cs, std::max(1U,nhits), stream); | ||
device_isOuterHitOfCell_.reset( | ||
(GPUCACell::OuterHitOfCell *)malloc(std::max(1U, nhits) * sizeof(GPUCACell::OuterHitOfCell))); | ||
// use "nhits" to heuristically dimension the workspace | ||
|
||
// no need to use the Traits allocations, since we know this is being compiled for the CPU | ||
//device_isOuterHitOfCell_ = Traits::template make_unique<GPUCACell::OuterHitOfCell[]>(std::max(1U, nhits), stream); | ||
device_isOuterHitOfCell_ = std::make_unique<GPUCACell::OuterHitOfCell[]>(std::max(1U, nhits)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I've fixed the commented out allocations as well (at least, they compile now), and I've changed the used ones from malloc
to make_unique
(to be on the safe side, in case new
were to do something different than plain malloc
).
@VinInn do you prefer to keep the commented out version ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fine for the time being. Once we will have an agreed heterogeneous allocator it will be changed together all other instances....
Validation summaryReference release CMSSW_11_3_0_pre5 at 23fad57 Validation plots/RelValTTbar_14TeV/CMSSW_11_2_0-PU_112X_mcRun3_2021_realistic_v14-v1/GEN-SIM-DIGI-RAW
/RelValZMM_14/CMSSW_11_2_0-112X_mcRun3_2021_realistic_v14-v1/GEN-SIM-DIGI-RAW
/RelValZEE_14/CMSSW_11_2_0-112X_mcRun3_2021_realistic_v13-v1/GEN-SIM-DIGI-RAW
Validation plots (CPU vs GPU)/RelValTTbar_14TeV/CMSSW_11_2_0-PU_112X_mcRun3_2021_realistic_v14-v1/GEN-SIM-DIGI-RAW
/RelValZMM_14/CMSSW_11_2_0-112X_mcRun3_2021_realistic_v14-v1/GEN-SIM-DIGI-RAW
/RelValZEE_14/CMSSW_11_2_0-112X_mcRun3_2021_realistic_v13-v1/GEN-SIM-DIGI-RAW
Throughput plots/EphemeralHLTPhysics1/Run2018D-v1/RAW run=323775 lumi=53logs and
|
No description provided.