Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MYNN fix for numerical stability issues with mixing snow #1754

Merged
merged 22 commits into from
Jun 12, 2023

Conversation

joeolson42
Copy link
Contributor

@joeolson42 joeolson42 commented May 13, 2023

Description

This is simply a removal of the snow mixing, reverting back to the RRFS Base E configuration. For some unknown reason, mixing snow was contributing to numerical instabilities. The ultimate cause of this instability will be investigated in the future but for now, snow mixing is deactivated.

There is also a small change to table values of Leaf Area Index to match them better with the LAI monthly climatology that is used in HRRR.

Also, 2-line change was made for a non-standard Fortran bug discovered by the Cray compiler (from John Michalakes). No known impact when using other compilers.

Input data additions/changes

  • No changes are expected to input data.
  • There will be new input data.
  • Input data will be updated.

Anticipated changes to regression tests:

  • No changes are expected to any regression test.
  • Changes are expected to the following tests:

Subcomponents involved:

  • AQM
  • CDEPS
  • CICE
  • CMEPS
  • CMakeModules
  • FV3
  • GOCART
  • HYCOM
  • MOM6
  • NOAHMP
  • WW3
  • stochastic_physics
  • none

Combined with PR's (If Applicable):

N/A

Commit Queue Checklist:

  • Link PR's from all sub-components involved
  • Confirm reviews completed in sub-component PR's
  • Add all appropriate labels to this PR.
  • Run full RT suite on either Hera/Cheyenne with both Intel/GNU compilers
  • Add list of any failed regression tests to "Anticipated changes to regression tests" section.

Linked PR's and Issues:

Testing Day Checklist:

  • This PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR.
  • Move new/updated input data on RDHPCS Hera and propagate input data changes to all supported systems.

Testing Log (for CM's):

  • RDHPCS
    • Intel
      • [ x] Hera
      • Orion
      • Jet
      • Gaea
      • Cheyenne
    • GNU
      • Hera
      • Cheyenne
  • WCOSS2
    • Dogwood/Cactus
    • Acorn
  • CI
    • Completed
  • opnReqTest
    • N/A
    • Log attached to comment

@joeolson42
Copy link
Contributor Author

Regression test results. Only fails are due to changes in the physics for all RAP, HRRR, and regional tests.
RegressionTests_hera.gnu.log
RegressionTests_hera.intel.log

@joeolson42
Copy link
Contributor Author

Updated regression tests. All compile & run. Only fails are due to changes from the baseline in suites that use MYNN or RUC (i.e., HRRR, RAP, regional).
RegressionTests_hera.gnu.log
RegressionTests_hera.intel.log

@zach1221 zach1221 added the Baseline Updates Current baselines will be updated. label Jun 2, 2023
@zach1221 zach1221 self-assigned this Jun 2, 2023
@zach1221
Copy link
Collaborator

zach1221 commented Jun 2, 2023

@joeolson42 It looks like this PR may be ready, you've provided hera logs and ccpp sub-pr has approval. We can start testing against it, if you can resolve the conflicts and ensure your branch is synced up?

@joeolson42
Copy link
Contributor Author

joeolson42 commented Jun 2, 2023 via email

@zach1221 zach1221 added the Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. label Jun 2, 2023
@zach1221
Copy link
Collaborator

zach1221 commented Jun 2, 2023

@BrianCurtis-NOAA I think this pr is ready. Going to run jenkins-ci first.

@zach1221 zach1221 added the jenkins-ci Jenkins CI: ORT build/test on docker container label Jun 2, 2023
@zach1221
Copy link
Collaborator

zach1221 commented Jun 4, 2023

Please see jenkins-ci logs attached. ORTs passed. I'll move forward with the remaining autoRT tests now. @BrianCurtis-NOAA fyi.
ufs-weather-model » ort-docker-pipeline » PR-1754 #1 Console [Jenkins].pdf

@zach1221 zach1221 added hera-BL Run Hera baseline creation cheyenne-BL and removed jenkins-ci Jenkins CI: ORT build/test on docker container hera-BL Run Hera baseline creation cheyenne-BL labels Jun 4, 2023
@zach1221 zach1221 added hera-BL Run Hera baseline creation cheyenne-BL labels Jun 4, 2023
@zach1221 zach1221 added the jet-BL Run Baseline and RT on Jet label Jun 4, 2023
@BrianCurtis-NOAA BrianCurtis-NOAA removed the jet-BL Run Baseline and RT on Jet label Jun 4, 2023
@BrianCurtis-NOAA
Copy link
Collaborator

Automated RT Failure Notification
Machine: jet
Job: BL
Repo location: /lfs4/HFIP/hfv3gfs/role.epic/autort/pr/1349366313/20230604185457/ufs-weather-model
Please manually delete: /lfs4/HFIP/hfv3gfs/role.epic/RT_RUNDIRS/role.epic/FV3_RT/rt_231427
Please make changes and add the following label back: jet-BL

on-behalf-of @ufs-community <ecc.platform@noaa.gov>
@BrianCurtis-NOAA
Copy link
Collaborator

@zach1221 I think we're working on #1743 currently

@BrianCurtis-NOAA BrianCurtis-NOAA removed the hera-BL Run Hera baseline creation label Jun 5, 2023
@zach1221
Copy link
Collaborator

zach1221 commented Jun 6, 2023

@joeolson42 The majority of the Regression Test cases passed, when ran against your PR, on Gaea. However, there are some issues with cpld_control_p8_faster & cpld_debug_p8 causing them to fail and it appears system related. I recommended we turn these two tests off for Gaea in rt.conf, for the time being, and create a separate issue to have this resolved. This will allow us to continue moving with this PR. Please let me know your thoughts. @jkbk2004
image

@joeolson42
Copy link
Contributor Author

joeolson42 commented Jun 6, 2023 via email

@zach1221
Copy link
Collaborator

Hello, update on issue #1790 , updating the tasks per node to TPN=24 in default_vars.sh passes for the majority of cases out of rt.conf, except for about 10 consisting of various hafs, regional and datm cases. For these failed cases, if we keep them at TPN=36 at the tests/tests level they too will pass. We can make the necessary updates in this PR, as a temporary solution, while we wait for the Gaea admin to respond to our inquiry and dig further into the cause of the issue. @jkbk2004

@jkbk2004
Copy link
Collaborator

Gaea tests pass with default TPN=24 but some cases need to keep TPN=36: datm_cdeps_lnd_gswp3, hafs_regional_specified_moving_1nest_atm, regional_2dwrtdecomp. I agree with the idea to finalize through the issue #1790 after receiving some information from gaea admin.

@zach1221
Copy link
Collaborator

Good morning, @BrianCurtis-NOAA . How's WCOSS2 and Acorn coming along?

@BrianCurtis-NOAA
Copy link
Collaborator

Good morning, @BrianCurtis-NOAA . How's WCOSS2 and Acorn coming along?

I was out Friday, so i've been working on baselines so far this AM. Almost ready to start comparisons.

@zach1221
Copy link
Collaborator

@BrianCurtis-NOAA understood. No problem!

@zach1221
Copy link
Collaborator

Testing is now complete, so we can proceed with merging the sub-PRs. I'll follow up on CCPP #71

@zach1221
Copy link
Collaborator

@joeolson42 fv3atm pr #656 has been merged. Please go ahead to update submodule pointer and revert the change in .gitmodules. @jkbk2004

@jkbk2004 jkbk2004 self-requested a review June 12, 2023 21:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Baseline Updates Current baselines will be updated. Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

binary/unary operator precedence error on Cray compiler
7 participants