Added benchmark output validation #645

stvoutsin · 2022-02-28T15:48:40Z

Modified test configurations to include md5 checksums for cell outputs. Does not include all cells. as some have output that changes per run (For example including current timestamp).
Upgrade version of aglais-testing that we use to 0.1.8 (which checks the output of the cell with the expected value)
Added notes on test deploy

…n/aglais into feature/upgrade-testing

Zarquan

Looks good. Can we add a test to demonstrate that the test fails when it should and showing what the fail output looks like.
Perhaps by making a copy of the quick config in /tmp and deliberately editing it to set the wrong checksum and then running the test to demonstrate a fail.

stvoutsin · 2022-03-04T14:33:03Z

I've added a bit more output to the testing suite, to include completion status (SUCCESS / SLOW / FAILED) as well as output validity of results (VALID/ INVALID). This might not be the clearest way to show the results, open to suggestions. However I've added some notes on testing this version with the valid checksums, as well as with modified checksums to check that we capture the missmatch.

Zarquan · 2022-03-10T00:18:15Z

notes/stv/20220304-test-deploy-validate-checksums.txt

+		'status': 'SLOW',
+		'msg': 'Expected/Actual output missmatch of cell #0! ',
+		'valid': 'FALSE'


This is what we need to avoid.

We need one value that indicates if the test passes or fails. If we are just checking the status, then SLOW suggests that the test passed but was slower than expected. Having to check a combination of both status and valid to determine if the test passed or not is a recipe for confusion.

Other suggestions aside (#649) the overall test result should be represented in a single value, so if the output validation fails, then status should be set to FAIL. Completing the test run but producing the wrong results should not be considered a success.

I've made some modifications to the benchmarker to match your suggestions at (#649) and some notes testing these changes

Zarquan

Looks good 👍

Zarquan · 2022-03-17T14:33:39Z

notes/stv/20220304-test-deploy-validate-checksums.txt

+		'status': 'SLOW',
+		'msg': 'Expected/Actual output missmatch of cell #0! ',
+		'valid': 'FALSE'


stvoutsin added 5 commits February 24, 2022 14:11

Added md5 checksums for individul cell, and upgrade test suite version

f4dbf35

Fixed expected checksums in test configurations

550e1be

Added notes on benchmarks with output validation

7db49c5

Copy expected output from basic to multiuser config

0be0797

Merge branch 'feature/upgrade-testing' of https://github.com/stvoutsi…

444a9c7

…n/aglais into feature/upgrade-testing

Zarquan requested changes Mar 3, 2022

View reviewed changes

stvoutsin added 2 commits March 4, 2022 14:27

Upgrading aglais-benchmark package to 0.1.9

163f12c

Added notes on testing validation

36bf416

Zarquan mentioned this pull request Mar 8, 2022

Verify notebook results #599

Closed

Zarquan requested changes Mar 10, 2022

View reviewed changes

stvoutsin added 3 commits March 15, 2022 16:05

Upgrade aglais-testing version

fa9faf8

Updated aglais testing version

227ce75

Add notes on testing benchmarker

9e2bbff

stvoutsin mentioned this pull request Mar 16, 2022

Enable Apache arrow #675

Merged

Zarquan approved these changes Mar 17, 2022

View reviewed changes

Zarquan merged commit 973819e into wfau:master Mar 17, 2022

stvoutsin deleted the feature/upgrade-testing branch July 4, 2022 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added benchmark output validation #645

Added benchmark output validation #645

stvoutsin commented Feb 28, 2022

Zarquan left a comment

stvoutsin commented Mar 4, 2022

Zarquan Mar 10, 2022

stvoutsin Mar 16, 2022

Zarquan Mar 17, 2022

Zarquan left a comment

Zarquan Mar 17, 2022

Added benchmark output validation #645

Added benchmark output validation #645

Conversation

stvoutsin commented Feb 28, 2022

Zarquan left a comment

Choose a reason for hiding this comment

stvoutsin commented Mar 4, 2022

Zarquan Mar 10, 2022

Choose a reason for hiding this comment

stvoutsin Mar 16, 2022

Choose a reason for hiding this comment

Zarquan Mar 17, 2022

Choose a reason for hiding this comment

Zarquan left a comment

Choose a reason for hiding this comment

Zarquan Mar 17, 2022

Choose a reason for hiding this comment