Cleanup for the framework PR #426

makortel · 2019-12-02T22:35:29Z

PR description:

While preparing a PR for CMSSW master I noticed that the unit tests (within the "framework" part) were not fully working on a non-GPU machine. As a subsequent cleanup this PR proposes

remove an unnecessary import of the gpu Modifier in testCUDASwitch_cfg.py, and fix an input configuration parameter relevant when running without GPU
remove testCUDA_cfg.py as not really useful
rename exitSansCUDADevices.cc to requireCUDADevices.cc for consistency

Then I realized that at least with catch2 unit test framework just exiting the program in requireCUDADevices() is too harsh, especially when a subset of the tests (in a file) would be useful to run without GPUs. Therefore I added hasCUDADevices() that returns a bool, which allows me to return instead of exit() from catch2 tests.

PR validation:

Code compiles, unit tests run on both non-GPU and GPU machines.

…itch_cfg.py

…n gracefully

fwyzard · 2019-12-02T22:45:50Z

Then I realized that at least with catch2 unit test framework just exiting the program in requireCUDADevices() is too harsh, especially when a subset of the tests (in a file) would be useful to run without GPUs.

Do we have examples of this (running a subset of the tests without GPUs) ?

makortel · 2019-12-02T22:54:53Z

Then I realized that at least with catch2 unit test framework just exiting the program in requireCUDADevices() is too harsh, especially when a subset of the tests (in a file) would be useful to run without GPUs.

Do we have examples of this (running a subset of the tests without GPUs) ?

cmssw/CUDADataFormats/Common/test/test_CUDAProduct.cc

Lines 25 to 33 in 832e57f

    
           TEST_CASE("Use of CUDAProduct template", "[CUDACore]") { 
        
             SECTION("Default constructed") { 
        
               auto foo = CUDAProduct<int>(); 
        
               REQUIRE(!foo.isValid()); 
        
               auto bar = std::move(foo); 
        
             } 
        
             if (not hasCUDADevices()) {

and

cmssw/HeterogeneousCore/CUDATest/test/test_TestCUDAProducerGPUFirst.cc

Lines 15 to 54 in 832e57f

    
           TEST_CASE("Standard checks of TestCUDAProducerGPUFirst", s_tag) { 
        
             const std::string baseConfig{ 
        
                 R"_(from FWCore.TestProcessor.TestProcess import * 
        
           process = TestProcess() 
        
           process.load("HeterogeneousCore.CUDAServices.CUDAService_cfi") 
        
           process.toTest = cms.EDProducer("TestCUDAProducerGPUFirst") 
        
           process.moduleToTest(process.toTest) 
        
           )_"}; 
        
             edm::test::TestProcessor::Config config{baseConfig}; 
        
             SECTION("base configuration is OK") { REQUIRE_NOTHROW(edm::test::TestProcessor(config)); } 
        
             SECTION("No event data") { 
        
               // Calls produce(), so don't call without a GPU 
        
               if (not hasCUDADevices()) { 
        
                 return; 
        
               } 
        
               edm::test::TestProcessor tester(config); 
        
               REQUIRE_NOTHROW(tester.test()); 
        
             } 
        
             SECTION("beginJob and endJob only") { 
        
               edm::test::TestProcessor tester(config); 
        
               REQUIRE_NOTHROW(tester.testBeginAndEndJobOnly()); 
        
             } 
        
             SECTION("Run with no LuminosityBlocks") { 
        
               edm::test::TestProcessor tester(config); 
        
               REQUIRE_NOTHROW(tester.testRunWithNoLuminosityBlocks()); 
        
             } 
        
             SECTION("LuminosityBlock with no Events") { 
        
               edm::test::TestProcessor tester(config); 
        
               REQUIRE_NOTHROW(tester.testLuminosityBlockWithNoEvents()); 
        
             } 
        
           }

makortel · 2019-12-02T23:03:34Z

Ok, neither of them is really testing essential functionality (on a non-GPU machine).

On the other hand, demonstrating that a test CUDA producer can be constructed and ran through lumi and run transitions without a GPU has some value (since that is what actually happens today with SwitchProducer).

Alternatively we could have separate unit test executables for GPU and non-GPU cases.

fwyzard · 2019-12-03T08:04:02Z

@makortel I don't mind adding the extra possibility - however I'm confused about the tests themselves...

In test_CUDAProduct.cc there is only one test, so what is the difference from just return vs exit() ?

In test_TestCUDAProducerGPUFirst.cc there are multiple SECTIONs, so I guess there it makes sense.

But more importantly, what do you mean by

a test CUDA producer can be constructed and ran through lumi and run transitions without a GPU has some value (since that is what actually happens today with SwitchProducer).

?
I thought the SwitchProducer would only call one of the two modules (the CUDA one if we have a GPU, the non-CUDA otherwise).

makortel · 2019-12-03T14:43:19Z

In test_CUDAProduct.cc there is only one test, so what is the difference from just return vs exit() ?

With return the main() of catch2 can finish and report successful tests in stdout. With exit() the program just quits (with proper exit code though).

a test CUDA producer can be constructed and ran through lumi and run transitions without a GPU has some value (since that is what actually happens today with SwitchProducer).

?
I thought the SwitchProducer would only call one of the two modules (the CUDA one if we have a GPU, the non-CUDA otherwise).

Right, but that only applies to event transitions (i.e. produce() etc). Construction+destruction and run+lumi transitions take place for all modules that are in Tasks/Sequences via Paths and EndPaths that are specified in Schedule (or all paths if there is no Schedule). I fully agree this is confusing, see issue cms-sw#26438.

fwyzard · 2019-12-03T16:12:28Z

I fully agree this is confusing, see issue cms-sw#26438.

Oh. Yes, yes, it is.

makortel added 5 commits December 2, 2019 23:17

Remove unnecessary import of gpu Modifier and fix input in testCUDASw…

f18f404

…itch_cfg.py

testCUDA_cfg.py is not really useful anymore

5e21382

Rename exitSansCUDADevices.cc to requireCUDADevices.cc

987f29b

Add hasCUDADevices() for tests to be able to let e.g. catch2 to retur…

4349497

…n gracefully

Use hasCUDADevices() in catch2 tests

832e57f

fwyzard merged commit 41b1c42 into cms-patatrack:CMSSW_11_0_X_Patatrack Dec 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup for the framework PR #426

Cleanup for the framework PR #426

makortel commented Dec 2, 2019 •

edited by fwyzard

Loading

fwyzard commented Dec 2, 2019

makortel commented Dec 2, 2019

makortel commented Dec 2, 2019

fwyzard commented Dec 3, 2019

makortel commented Dec 3, 2019

fwyzard commented Dec 3, 2019

Cleanup for the framework PR #426

Cleanup for the framework PR #426

Conversation

makortel commented Dec 2, 2019 • edited by fwyzard Loading

PR description:

PR validation:

fwyzard commented Dec 2, 2019

makortel commented Dec 2, 2019

makortel commented Dec 2, 2019

fwyzard commented Dec 3, 2019

makortel commented Dec 3, 2019

fwyzard commented Dec 3, 2019

makortel commented Dec 2, 2019 •

edited by fwyzard

Loading