Add performance and longevity testing validation to the release template #1752

travisbenedict · 2022-03-14T18:00:19Z

Signed-off-by: Travis Benedict benedtra@amazon.com

Description

Add steps to the release template for validating performance and longevity test results. These steps are not necessarily exhaustive but can serve as a starting point for our release process. Ultimately, these validations should be automated as our performance testing automation is built out more.

More details on the testing setup can be found in this issue outlining the results of performance experiments that I conducted - opensearch-project/OpenSearch#2461

Issues Resolved

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Travis Benedict <benedtra@amazon.com>

dblock

I think the long page of details of how to identify regression tests doesn't belong here. We have a TODO in https://github.com/opensearch-project/opensearch-build/tree/main/src/test_workflow#performance-tests to fill out performance testing, care to move it there as part of this PR?

Signed-off-by: Travis Benedict <benedtra@amazon.com>

travisbenedict · 2022-03-16T14:48:35Z

src/test_workflow/README.md

@@ -99,7 +99,65 @@ opensearch-dashboards=https://ci.opensearch.org/ci/dbc/bundle-build-dashboards/1

 ### Performance Tests

-TODO
+TODO: Add instructions on how run performance tests with `test.sh`


These instructions can be updated with #1671 or after it's merged

Signed-off-by: Travis Benedict <benedtra@amazon.com>

dblock · 2022-03-16T14:51:48Z

src/test_workflow/README.md

+
+
+
+#### How to identify regressions in performance tests


Add to TOC, cleanup capitalization to match other topics.

I would rename to "Identifying Regressions in Performance Tests"

dblock · 2022-03-16T14:53:59Z

src/test_workflow/README.md

+
+Disclaimer: the guidelines listed below were determined based on empirical testing using OpenSearch Benchmark. 
+These tests were run against OpenSearch 1.2 build #762 and used the nyc_taxis workload with 2 warmup and 3 test iterations. 
+The values listed below are **not** applicable to other configurations. More details on the test setup can be found here: https://github.com/opensearch-project/OpenSearch/issues/2461


This disclaimer is scary. It says that you cannot trust results. Try to be more prescriptive, remove that this is a disclaimer. What is one supposed to actually do? Run tests, then compare results. That's what this doc should say.

Good points. I'll rewrite this section

dblock · 2022-03-16T14:54:49Z

src/test_workflow/README.md

+
+Note that performance regressions are based on decreased indexing throughput and/or increased query latency.
+
+Additionally, error rates on the order of 0.01% are acceptable, though higher ones may be cause for concern.


This should be a section of its own. What happens if the error rates are higher? What does one do?

dblock · 2022-03-16T14:55:34Z

src/test_workflow/README.md

+
+
+
+#### How to identify issues in longevity tests


There's zero information on what longevity tests are anywhere in these docs. To an uneducated reader it's impossible to understand what one does with it. Please provide context to all these things.

codecov-commenter · 2022-03-16T15:01:59Z

Codecov Report

Merging #1752 (1841fc2) into main (faf26d1) will increase coverage by 5.20%.
The diff coverage is n/a.

@@              Coverage Diff              @@
##               main     #1752      +/-   ##
=============================================
+ Coverage     94.79%   100.00%   +5.20%     
=============================================
  Files           169         6     -163     
  Lines          3536       105    -3431     
  Branches         26        19       -7     
=============================================
- Hits           3352       105    -3247     
+ Misses          181         0     -181     
+ Partials          3         0       -3

Impacted Files	Coverage Δ
src/test_workflow/integ_test/integ_test_runner.py
...ests/jenkins/jobs/CopyContainer_docker_Jenkinsfile
...est_workflow/bwc_test/bwc_test_suite_opensearch.py
.../assemble_workflow/bundle_opensearch_dashboards.py
tests/jenkins/jobs/CreateReleaseTag_Jenkinsfile
...workflow/opensearch/build_artifact_check_plugin.py
src/run_manifests.py
src/ci_workflow/ci_check_list_dist.py
src/paths/tree_walker.py
...bs/PrintArtifactDownloadUrlsForStaging_Jenkinsfile
... and 153 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update faf26d1...1841fc2. Read the comment docs.

Signed-off-by: Travis Benedict <benedtra@amazon.com>

src/test_workflow/README.md

Signed-off-by: Travis Benedict <benedtra@amazon.com>

dblock

This is a lot better, thank you! I went super nitpicky on the comments, feel free to make some/none/all the changes.

.github/ISSUE_TEMPLATE/release_template.md

src/test_workflow/README.md

dblock · 2022-03-17T14:35:50Z

src/test_workflow/README.md

@@ -3,6 +3,9 @@
  - [Integration Tests](#integration-tests)
  - [Backwards Compatibility Tests](#backwards-compatibility-tests)
  - [Performance Tests](#performance-tests)
+    - [Identifying Regressions in Performance Tests](#identifying-regressions-in-performance-tests)
+      - [Identifying Regressions in Nightly Performance Tests](#identifying-regressions-in-nightly-performance-tests)


Did you mean all these to the at the same level?

I meant for the Nightly test section to be a subsection of performance tests in general, since it's worth reading that section first to get some background.

src/test_workflow/README.md

Signed-off-by: Travis Benedict <benedtra@amazon.com>

travisbenedict · 2022-03-17T17:27:03Z

Thanks for the help on this @dblock. I really appreciate the feedback

Add performance and longevity testing validation to the release template

ca53991

Signed-off-by: Travis Benedict <benedtra@amazon.com>

travisbenedict requested a review from a team as a code owner March 14, 2022 18:00

dblock requested changes Mar 15, 2022

View reviewed changes

Move validation details to test_workflow README

1d22db4

Signed-off-by: Travis Benedict <benedtra@amazon.com>

travisbenedict commented Mar 16, 2022

View reviewed changes

Typo

c6979b7

Signed-off-by: Travis Benedict <benedtra@amazon.com>

dblock requested changes Mar 16, 2022

View reviewed changes

Add TOC entry, remove disclaimer, update wording

0d7281d

Signed-off-by: Travis Benedict <benedtra@amazon.com>

travisbenedict commented Mar 16, 2022

View reviewed changes

src/test_workflow/README.md Outdated Show resolved Hide resolved

Fix TOC indentation

b7194d4

Signed-off-by: Travis Benedict <benedtra@amazon.com>

dblock approved these changes Mar 17, 2022

View reviewed changes

Clean up nits

1841fc2

Signed-off-by: Travis Benedict <benedtra@amazon.com>

gaiksaya merged commit 087b64b into opensearch-project:main Mar 17, 2022

travisbenedict deleted the release_template branch March 17, 2022 17:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add performance and longevity testing validation to the release template #1752

Add performance and longevity testing validation to the release template #1752

travisbenedict commented Mar 14, 2022 •

edited

Loading

dblock left a comment

travisbenedict Mar 16, 2022

dblock Mar 16, 2022

dblock Mar 16, 2022

travisbenedict Mar 16, 2022

dblock Mar 16, 2022

dblock Mar 16, 2022

codecov-commenter commented Mar 16, 2022 •

edited

Loading

dblock left a comment

dblock Mar 17, 2022

travisbenedict Mar 17, 2022

travisbenedict commented Mar 17, 2022


		Note that performance regressions are based on decreased indexing throughput and/or increased query latency.

		Additionally, error rates on the order of 0.01% are acceptable, though higher ones may be cause for concern.

Add performance and longevity testing validation to the release template #1752

Add performance and longevity testing validation to the release template #1752

Conversation

travisbenedict commented Mar 14, 2022 • edited Loading

Description

Issues Resolved

Check List

dblock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Mar 16, 2022 • edited Loading

Codecov Report

dblock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

travisbenedict commented Mar 17, 2022

travisbenedict commented Mar 14, 2022 •

edited

Loading

codecov-commenter commented Mar 16, 2022 •

edited

Loading