Feature 768 regression tests for use cases #810

georgemccabe · 2021-02-18T01:20:23Z

Note: some changes for #768 are in this PR. These changes have been tested and verified that they work properly. I tried to avoid creating extra work to separate out the changes to implement the regression testing logic from that work.

I created a develop-ref branch from the source branch of this pull request. Branches ending with "-ref" trigger the creation of Docker data volumes that are used as "truth" data to diff the output from a pull request.

Pull Request Testing

Describe testing already performed for these changes:

Tested the logic using feature_768_test_regression-ref to test generation of truth data, then created a pull request from feature_768_test_pr into feature_768_test_regression to test that diffing logic executes properly.

Recommend testing for the reviewer(s) to perform, including the location of input datasets, and any additional instructions:

Review code changes
Verify that Docker data volumes for output data exist for all use case groups. They should all start with output-develop and are named based on the use case category and any subset numbers (if applicable). There should be 13 of them (matches the number of use cases groups run in the matrix)

https://hub.docker.com/repository/docker/dtcenter/metplus-data-dev/tags

Review GitHub Actions log output to ensure everything ran as expected
- Log for creating truth data volumes from develop-ref is here:
  https://github.com/dtcenter/METplus/runs/1923380457?check_suite_focus=true
- Log for pull request diff testing can be found under Actions tab (look for run that mentioned it is a pull request). The output diff logs are found in the same log as running each use case group. The use case job will turn red if any use case fails or if any differences are found in the output.

Note: Output from GempakToCF creates files that have a fill value of -9999 but contains many data points that are NaN. This caused issues in the diffing logic, so I changed it to output a WARNING if this case occurs. I previously added logic to loop over every data point and compare them if they are both not NaN values, but it increased execution time greatly. This could be revisited later to improve the diffing logic.

Do these changes include sufficient documentation and testing updates? [Yes]
Will this PR result in changes to the test suite? [Yes]

It makes the test suite much more robust!

Pull Request Checklist

See the METplus Workflow for details.

Complete the PR definition above.
Ensure the PR title matches the feature or bugfix branch name.
Define the PR metadata, as permissions allow.
Select: Reviewer(s), Project(s), and Milestone
After submitting the PR, select Linked Issues with the original issue number.
After the PR is approved, merge your changes. If permissions do not allow this, request that the reviewer do the merge.
Close the linked issue and delete your feature or bugfix branch from GitHub.

…nd removed function to validate length of config values that can be reported in the MET log output, updated tests to match new format and removed test for function that was removed

…s and removed check for list of newly deprecated variables. Moved this check to another location to produce warnings if an expected METPLUS_ variable is not being utilized instead.

…er when it is optional to use

…void failure

…before

…s been converted (from Gempak) or unzipped

…t aren't handled properly by the fill value

… error if user env vars are not set in environment (i.e. if running wrapper without calling run method, like GempakToCF in met_util)

JohnHalleyGotway

I approve of these changes. Details below.

These changes look great and represent a huge improvement to our automated testing. As discussed during the METplus project meeting on 2/18/21, recommend considering the following enhancements:

When differences are detected, add the files from the reference "truth" branch to the artifact created for that pull request GHA. That'll make it really easy for the PR reviewer to investigate the diffs.
Can we make the docker data volumes more self-describing? If possible, add a "/README" file that lists info like the creation date, source branch or SHA. There's potential to easily lose track of what data volumes were created by what version of the code and when.
Recommend adding a mechanism to list files that should be EXCLUDED from the diffing logic. Could be as simple as a text file listing files to be skipped. I'm sure this will come up and be useful.

These changes do NOT need to be made on this branch but should be considered for the future.

As directed, I checked the dtcenter/metplus-data-dev tags and confirmed that 13 are present, one for each of the groups of tests.
Pulled one and confirmed the data looks reasonable:

docker run -it --rm dtcenter/metplus-data-dev:output-develop-met_tool_wrapper ls /data/truth

Reviewed the code changes and there are tons of them... 82 files. Lots and lots of changes included in this PR.
- Changes for automation.
- Changes to METplus python wrappers.
- Changes to METplus conf files.
- Changes to MET config files.

I reviewed some of the changes, but don't fully appreciate all of them. Recommend in the future trying to break down PR's into smaller, more self-contained pieces.

Co-authored-by: bikegeek <minnawin@ucar.edu> Co-authored-by: bikegeek <3753118+bikegeek@users.noreply.github.com> Co-authored-by: Julie.Prestopnik <jpresto@ucar.edu>

georgemccabe added 30 commits February 4, 2021 11:56

removed commented code

98fb427

removed commented code

7d4d910

added support for new env var format for TCStat, cleaned up wrapper a…

63aa4e2

…nd removed function to validate length of config values that can be reported in the MET log output, updated tests to match new format and removed test for function that was removed

improve testing scripts

c76c739

Merge branch 'develop' into feature_768_met_config

29329cc

clean up code to check for deprecated env vars in MET config variable…

23e80c8

…s and removed check for list of newly deprecated variables. Moved this check to another location to produce warnings if an expected METPLUS_ variable is not being utilized instead.

added check for unused env vars in wrapped MET config file

f00a566

made fix to handle case where MET config file is not used for a wrapp…

5b30a5f

…er when it is optional to use

fixed test conf file to include a path to a real MET config file to a…

8fd071e

…void failure

removed commented line

475494e

enhanced diffing logic to check if a file is NetCDF or image

eef2b70

use INIT_BEG/END if TC_PAIRS_INIT_BEG/END are not set

59fd110

set TC_PAIRS_INIT_BEG/END to INIT_BEG/END to achieve same results as …

45ef830

…before

updated PointStat tests and fixed bug in setting obs_window

dded305

get rid of lines that are no longer used

1d20e09

added unit tests for ascii2nc wrapper

64e5f39

fixed missing f-string

fc7c758

set DO_NOT_RUN_EXE to True and INPUT_MUST_EXIST ti False for unit tests

7e57791

fixed config overrides

45ecfdb

rename and add logic to skip certain files like .zip

b6ac622

added script to get truth artifacts

2aff66a

modified GH-A to remove many use cases to test

20f8310

changed tabs to spaces

03688ac

fixed typo

64e45b5

skip run of use case for testing

1ed44c9

fixed typo

db1ee4b

missing import

51c9e66

print command

192e141

commented out other jobs to speed up testing

bb30050

get abspath of path with .. in it

3d9254d

georgemccabe added 15 commits February 17, 2021 12:29

add print statements if no differences were found in diff test

1430486

skip staging directory since the data is typically input data that ha…

e9c8772

…s been converted (from Gempak) or unzipped

fixed catching error if use case or diff test fails

d2e11b9

fixed name of use case to remove conf so diff test will succeed

1e9e02f

add another print if filecmp shows no diffs in text files

43a6d92

fixed always failure even on success

c092f80

fixed relative path to get miniconda script

b6f5c81

fix relative path

96a9d57

skip diffing if any use cases failed

3343706

make error logs more clear

a060c70

fixed relative path to script

1e6e93f

change path to scripts

47d90ca

report a warning and skip is NaN values are found in NetCDF field tha…

68b0280

…t aren't handled properly by the fill value

set environment variables in GempakToCF wrapper just in case, prevent…

3f2c18d

… error if user env vars are not set in environment (i.e. if running wrapper without calling run method, like GempakToCF in met_util)

fixed image diff logic

503aa06

georgemccabe added this to the METplus-4.0.0 milestone Feb 18, 2021

georgemccabe requested a review from JohnHalleyGotway February 18, 2021 01:20

georgemccabe linked an issue Feb 18, 2021 that may be closed by this pull request

Update setting of environment variables for MET config files to add support for all to METPLUS_ vars #768

Closed

22 tasks

georgemccabe added 2 commits February 18, 2021 07:08

print out pixel difference

24edc04

removed pixel different printing

a04e0df

JohnHalleyGotway approved these changes Feb 18, 2021

View reviewed changes

georgemccabe mentioned this pull request Feb 18, 2021

Improve regression testing logic #812

Open

21 tasks

georgemccabe merged commit a22ad67 into develop Feb 18, 2021

georgemccabe deleted the feature_768_test_pr branch February 18, 2021 19:07

georgemccabe mentioned this pull request Feb 18, 2021

Update develop-ref after #810 #813

Merged

10 tasks

georgemccabe linked an issue Feb 18, 2021 that may be closed by this pull request

Investigate using Docker Data Volumes for testing #567

Closed

19 tasks

georgemccabe mentioned this pull request Feb 18, 2021

Investigate using Docker Data Volumes for testing #567

Closed

19 tasks

georgemccabe added a commit that referenced this pull request Feb 18, 2021

Update develop-ref after #810 (#813)

9e2bf71

Co-authored-by: bikegeek <minnawin@ucar.edu> Co-authored-by: bikegeek <3753118+bikegeek@users.noreply.github.com> Co-authored-by: Julie.Prestopnik <jpresto@ucar.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature 768 regression tests for use cases #810

Feature 768 regression tests for use cases #810

georgemccabe commented Feb 18, 2021 •

edited

Loading

JohnHalleyGotway left a comment •

edited

Loading

Feature 768 regression tests for use cases #810

Feature 768 regression tests for use cases #810

Conversation

georgemccabe commented Feb 18, 2021 • edited Loading

Pull Request Testing

Pull Request Checklist

JohnHalleyGotway left a comment • edited Loading

Choose a reason for hiding this comment

georgemccabe commented Feb 18, 2021 •

edited

Loading

JohnHalleyGotway left a comment •

edited

Loading