I've got some issues with the following tests (which are part of the pre-alpha tests) :
- SMS_Ld1.f19_f19_mg16.FXSD.hydra_gnu.cam-outfrq1d
When I submit this test everything works normal and the nescessairy input files are being downloaded, however during the RUN PHASE the test errors with the following error:
ERROR: GETFIL: FAILED to get /gpfs/projects/climate/cesm/inputdata/atm/cam/met/MERRA/2000/MERRA_19x2_20000102.nc
Which I find is odd since the test downloaded other nescessairy files such as MERRA_19x2_20000101.nc
I'm puzzled by this since no of the other tests have any issues with missing files
- IRT_N3_PM3_Ld7.f19_g17.BHISTWs.hydra_gnu.allactive-defaultio
This test seems to run properly for most part but in the beginning of the cesm.log file there are some odd messages:
mca_base_component_repository_open: unable to open mca_oob_ud: libosmcomp.so.3: cannot open shared object file: No such file or directory (ignored)
This message is repeated several times for each compute node and core, afterwards the test start giving normal log messages and at the end of the test it fails with the following error:
--------------------------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpiexec noticed that process rank 27 with PID 195130 on node node319 exited on signal 9 (Killed).
--------------------------------------------------------------------------
Does anyone have any how to solve these errors? Or potentially what they might be related to?
- SMS_Ld1.f19_f19_mg16.FXSD.hydra_gnu.cam-outfrq1d
When I submit this test everything works normal and the nescessairy input files are being downloaded, however during the RUN PHASE the test errors with the following error:
ERROR: GETFIL: FAILED to get /gpfs/projects/climate/cesm/inputdata/atm/cam/met/MERRA/2000/MERRA_19x2_20000102.nc
Which I find is odd since the test downloaded other nescessairy files such as MERRA_19x2_20000101.nc
I'm puzzled by this since no of the other tests have any issues with missing files
- IRT_N3_PM3_Ld7.f19_g17.BHISTWs.hydra_gnu.allactive-defaultio
This test seems to run properly for most part but in the beginning of the cesm.log file there are some odd messages:
mca_base_component_repository_open: unable to open mca_oob_ud: libosmcomp.so.3: cannot open shared object file: No such file or directory (ignored)
This message is repeated several times for each compute node and core, afterwards the test start giving normal log messages and at the end of the test it fails with the following error:
--------------------------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpiexec noticed that process rank 27 with PID 195130 on node node319 exited on signal 9 (Killed).
--------------------------------------------------------------------------
Does anyone have any how to solve these errors? Or potentially what they might be related to?