DAOS-16016 test: Coverity 2555785 fix and run test_rebuild_35 #14619

kccain · 2024-06-20T15:49:52Z

Check return code of daos_cont_destroy() call in the test
rebuild_cont_destroy_and_reintegrate(). Also make sure that
test is actually executed at least in daily_regression CI.

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium-md-on-ssd: false
Test-tag: test_rebuild_35
Test-repeat: 10

Before requesting gatekeeper:

Two review approvals and any prior change requests have been resolved.
Testing is complete and all tests passed or there is a reason documented in the PR why it should be force landed and forced-landing tag is set.
Features: (or Test-tag*) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.
Commit messages follows the guidelines outlined here.
Any tests skipped by the ticket being addressed have been run and passed in the PR.

Gatekeeper:

github-actions · 2024-06-20T15:50:15Z

Errors are Unable to load ticket data
https://daosio.atlassian.net/browse/DAOS-16016

kccain · 2024-06-20T16:00:20Z

src/tests/ftest/daos_test/rebuild.py

+        Use cases:
+            Core tests for daos_test rebuild
+
+        :avocado: tags=all,pr,daily_regression


@phender and @daltonbohning I'm working on this separate patch due to DAOS-16016. The issue of extra time expenditure for new tests like this has come up in another PR #14584

This one I estimate will run in about 2 minutes of execution time (based only on running with tmpfs on a local developer cluster, not functional hw testing in CI). Is it worth adding that to per-pr testing? I can see an argument for covering the new rebuild tests (particularly any that are very lengthy) to be tested only as part of daily_regression (not pr). It's still better than what we have now, which is no regular coverage for a few of these recently-developed tests. What do you think for this one (pr,daily_regression or just daily_regression)? We can take a look at how the hw functional testing shakes out too, to confirm my time estimate.

It's not just the test time of a single test that's the issue. It's when we add 2 minutes here, 2 minutes there, ... and suddenly we have several hours of PR testing. For rebuild specifically, this is a huge issue. Last I checked, there is something like 6-8 hours of rebuild pr testing. Which means even if a PR doesn't affect rebuild in any way, it still spends 6-8 hours running rebuild testing. (this is terrible for CI time and one of the contributors to slow turnaround times)

I can't tell you whether this test should be pr or daily or weekly because I don't know how likely it is to be affected by code changes. So instead, ask yourself

When pushing a PR, can you generally tell based on the code changes made that THIS test needs to be ran? And would using Features: rebuild be sensible?

How critical is this test? If this test fails, does it mean rebuild is effectively unusable? Or is it some edge case?

Realistically, we can't run everything in PR, so the idea is to mostly tag critical, quick tests with pr. Then, PRs should use Features to pull in additional test areas as needed. But at the same time, we don't have a perfect code <-> test mapping, so we don't always know whether we need to run a test.

Yeah I think this one is a bit specific to reintegration and can be run in daily_regression, and in Features: rebuild in certain PRs, so we catch any inadvertent regressions in a fairly timely manner

daltonbohning · 2024-06-20T16:45:50Z

src/tests/ftest/daos_test/rebuild.py

+
+        :avocado: tags=all,daily_regression
+        :avocado: tags=hw,medium
+        :avocado: tags=unittest


Recommend rebuild tag here, so it's ran with Features: rebuild

Suggested change

:avocado: tags=unittest

:avocado: tags=unittest,rebuild

Check return code of daos_cont_destroy() call in the test rebuild_cont_destroy_and_reintegrate(). Also make sure that test is actually executed at least in daily_regression CI. Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-medium-md-on-ssd: false Test-tag: test_rebuild_35 Test-repeat: 10 Signed-off-by: Kenneth Cain <kenneth.c.cain@intel.com>

kccain · 2024-06-23T20:35:23Z

20 loops passed in CI functional hw testing (10 loops in functional HW medium, 10 loops in functional HW medium MD on SSD configuration). https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-14619/3/

Ready for review

Check return code of daos_cont_destroy() call in the test rebuild_cont_destroy_and_reintegrate(). Also make sure that test is actually executed at least in daily_regression CI. Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-medium-md-on-ssd: false Test-tag: test_rebuild_35 Test-repeat: 10 Signed-off-by: Kenneth Cain <kenneth.c.cain@intel.com>

#14642) Check return code of daos_cont_destroy() call in the test rebuild_cont_destroy_and_reintegrate(). Also make sure that test is actually executed at least in daily_regression CI. Signed-off-by: Kenneth Cain <kenneth.c.cain@intel.com>

…tack#14619) Check return code of daos_cont_destroy() call in the test rebuild_cont_destroy_and_reintegrate(). Also make sure that test is actually executed at least in daily_regression CI. Signed-off-by: Kenneth Cain <kenneth.c.cain@intel.com>

kccain commented Jun 20, 2024

View reviewed changes

kccain force-pushed the kccain/daos_16016_master branch from 35aff04 to ccdbc01 Compare June 20, 2024 16:38

daltonbohning reviewed Jun 20, 2024

View reviewed changes

kccain force-pushed the kccain/daos_16016_master branch from ccdbc01 to 56ebdba Compare June 20, 2024 17:04

kccain requested review from liuxuezhao and daltonbohning June 23, 2024 20:33

kccain marked this pull request as ready for review June 23, 2024 20:34

kccain requested review from a team as code owners June 23, 2024 20:34

daltonbohning approved these changes Jun 24, 2024

View reviewed changes

kccain added the forced-landing The PR has known failures or has intentionally reduced testing, but should still be landed. label Jun 24, 2024

kccain requested a review from janekmi June 24, 2024 17:59

mchaarawi approved these changes Jun 25, 2024

View reviewed changes

knard38 approved these changes Jun 25, 2024

View reviewed changes

kccain requested a review from a team June 25, 2024 12:28

mchaarawi merged commit e5d26ea into master Jun 25, 2024
45 checks passed

mchaarawi deleted the kccain/daos_16016_master branch June 25, 2024 12:29

kccain added clean-cherry-pick Cherry-pick from another branch that did not require additional edits and removed clean-cherry-pick Cherry-pick from another branch that did not require additional edits labels Jun 25, 2024

mjmac mentioned this pull request Nov 13, 2024

mjmac/DAOS 16787 google 2.6 #15498

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DAOS-16016 test: Coverity 2555785 fix and run test_rebuild_35 #14619

DAOS-16016 test: Coverity 2555785 fix and run test_rebuild_35 #14619

kccain commented Jun 20, 2024 •

edited

Loading

github-actions bot commented Jun 20, 2024

kccain Jun 20, 2024

daltonbohning Jun 20, 2024

kccain Jun 20, 2024

daltonbohning Jun 20, 2024

kccain commented Jun 23, 2024

DAOS-16016 test: Coverity 2555785 fix and run test_rebuild_35 #14619

DAOS-16016 test: Coverity 2555785 fix and run test_rebuild_35 #14619

Conversation

kccain commented Jun 20, 2024 • edited Loading

Before requesting gatekeeper:

Gatekeeper:

github-actions bot commented Jun 20, 2024

kccain Jun 20, 2024

Choose a reason for hiding this comment

daltonbohning Jun 20, 2024

Choose a reason for hiding this comment

kccain Jun 20, 2024

Choose a reason for hiding this comment

daltonbohning Jun 20, 2024

Choose a reason for hiding this comment

kccain commented Jun 23, 2024

kccain commented Jun 20, 2024 •

edited

Loading