You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When rt.sh is used to run the regression tests, compile jobs that fail go unnoticed unless there are regression tests following that depend on the executable from that compile job.
To Reproduce:
With the current hash of develop (6daad90), run the GNU regression tests on hera.
The compile jobs for the coupled model fail due to recent code changes in CMEPS (that are incompatible with GNU but work with Intel). These errors go unnoticed, because no coupled model tests are run.
Additional context
The error is (thanks to @pjpegion for reporting this):
/scratch2/BMC/gsienkf/Philip.Pegion/ufs-weather-model-develop/CMEPS-interface/CMEPS/mediator/esmFldsExchange_nems_mod.F90:87:46:
87 | 'Sa_u10m','Sa_v10m', 'Sa_t2m ', 'Sa_q2m'/)
| 1
Error: Different CHARACTER lengths (7/6) in array constructor at (1)
make[2]: *** [CMEPS-interface/CMakeFiles/cmeps.dir/CMEPS/mediator/esmFldsExchange_nems_mod.F90.o] Error 1
The solution for this particular problem is to pad the entries with trailing whitespaces that they all have the same length, or alternatively use something like
When job fails, slurm reports job status as '-' (unknown):
Job id 15319645
TEST 10 compile is waiting to enter the queue
TEST 10 compile is submitted
1 min. TEST 10 compile is pending, status: PD jobid 15319645
2 min. TEST 10 compile is running, status: R jobid 15319645
Slurm unknown status -. Check sacct ...
15319645 FAILED compile_10
15319645.ba+ FAILED batch
15319645.ex+ COMPLETED extern
3 min. TEST 10 compile is FAILED, status: - jobid 15319645
in such cases we must check the status_label that sacct returns for a given jobid and manually set test_status.
This is not the first time (and certainly will not be the last) this kind of error caused GNU build to fail. So, I suggest we make the Intel compiler fail as well on this particular error, by adding: -std -diag-error=8208
to all CMAKE_Fortran_FLAGS in all the components we control.
Description
When
rt.sh
is used to run the regression tests, compile jobs that fail go unnoticed unless there are regression tests following that depend on the executable from that compile job.To Reproduce:
With the current hash of develop (6daad90), run the GNU regression tests on hera.
The compile jobs for the coupled model fail due to recent code changes in CMEPS (that are incompatible with GNU but work with Intel). These errors go unnoticed, because no coupled model tests are run.
Additional context
The error is (thanks to @pjpegion for reporting this):
The solution for this particular problem is to pad the entries with trailing whitespaces that they all have the same length, or alternatively use something like
The text was updated successfully, but these errors were encountered: