You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In cold-started half-cycle tests the gdaspostanl job completes as successful but upon inspection of log there is an error from exgdas_atmos_nceppost.sh that is not being handled properly:
err_chk doesn't really do anything on RDHPCS machines
Since this is an error code being set directly rather than from a command return ($?), and only run through err_chk, even with set -e on this isn't picked up (no bash statement returns non-zero). Combined with (1), this means the program sets err=9 but doesn't actually exit.
The failure of this script doesn't seem to prevent the cycled run from continuing on normally. Should this even a fatal error for the cold-start half-cycle?
It's trivial to make these scripts exit with an error, which is how they seem to be written. But we need to figure out if they should, at least for that half-cycle.
More long-term, we need to decide what we're going to do with err_chk. I'm thinking we can put it in the postamble, so scripts would just call exit # and then the trap would call err_chk on wcoss. But we can discuss that elsewhere.
Moves the member directory one higher level in the directory hierarchy in anticipation of additionally components being run for the members in addition to atmos. This results in COM directories that are arranged as `memXXX/atmos` instead of `atmos/memXXX`.
This PR also adds a "hack" to allow the success for the `gdaspostanl` job in the first half cycle (see #1034). The "hack" will be removed with the refactoring of the post jobs in the (near) future.
Fixes#1196
Refs #1034
Description
In cold-started half-cycle tests the gdaspostanl job completes as successful but upon inspection of log there is an error from
exgdas_atmos_nceppost.sh
that is not being handled properly:The
err=9
doesn't result in an error exit.Requirements
Bug fix. The script should exit when it can't find the atmanl.nc file.
Acceptance Criteria (Definition of Done)
The script correctly exits when an error occurs.
The text was updated successfully, but these errors were encountered: