Additional info - Attached are my version_info.txt, config_machines.xml, config_batch.xml and config_compilers.xml files (with xml files altered to txt files due to uploading restrictions) as requested in the 'Information to include' page. I should also mention that the changes made to the externals are just an addition to the wget command in the config_inputdata.xml file as detailed
here and addition of PES layout as detailed
here. Compiler version is gnu 7.5.0
I should also mention that for other tests the test output is also difficult to decipher, for example the O_TestTestScheduler.test_d_retry test which gives, as the output to the cs.status script the following:
20210922_180654
TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu (Overall: PASS) details:
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu CREATE_NEWCASE
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu XML
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu SETUP
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu SHAREDLIB_BUILD time=0
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu MODEL_BUILD time=1
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu SUBMIT
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu RUN time=1
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu MEMLEAK insuffiencient data for memleak test
PASS TESTBUILDFAIL_P1.f19_g16_rx1.A.archer2_gnu SHORT_TERM_ARCHIVER
TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu (Overall: FAIL) details:
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu CREATE_NEWCASE
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu XML
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu SETUP
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu SHAREDLIB_BUILD time=0
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu MODEL_BUILD time=1
PASS TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu SUBMIT
FAIL TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu RUN time=1
TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu (Overall: PASS) details:
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu CREATE_NEWCASE
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu XML
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu SETUP
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu SHAREDLIB_BUILD time=0
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu MODEL_BUILD time=1
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu SUBMIT
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu RUN time=2
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu MEMLEAK insuffiencient data for memleak test
PASS TESTRUNPASS_P1.f19_g16_rx1.A.archer2_gnu SHORT_TERM_ARCHIVER
In the TESTRUNFAIL dir the main output file contains the following:
Running test for TESTRUNFAIL
WARNING: Found difference in test CHECK_TIMING: case: False original value True
doing an 11 ndays startup test, with restarts every 11 ndays
File /work/n02/n02/csymonds/cesm/CESM2.1.3/runs/scripts_regression_test.20210922_180654/TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu.20210922_180654/LockedFiles/env_build.xml has been modified
Creating component namelists
Calling /lus/cls01095/work/n02/n02/csymonds/cesm/CESM2.1.3/my_cesm_sandbox/cime/src/drivers/mct/cime_config/buildnml
Finished creating component namelists
-------------------------------------------------------------------------
- Prestage required restarts into /work/n02/n02/csymonds/cesm/CESM2.1.3/runs/scripts_regression_test.20210922_180654/TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu.20210922_180654/run
- Case input data directory (DIN_LOC_ROOT) is /work/n02/n02/csymonds/cesm/CESM2.1.3/cesm_inputdata
- Checking for required input datasets in DIN_LOC_ROOT
-------------------------------------------------------------------------
2021-09-22 18:07:18 MODEL EXECUTION BEGINS HERE
run command is srun --distribution=block:block --hint=nomultithread /work/n02/n02/csymonds/cesm/CESM2.1.3/runs/scripts_regression_test.20210922_180654/TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu.20210922_180654/bld/cesm.exe >> cesm.log.$LID 2>&1
ERROR: RUN FAIL: Command 'srun --distribution=block:block --hint=nomultithread /work/n02/n02/csymonds/cesm/CESM2.1.3/runs/scripts_regression_test.20210922_180654/TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu.20210922_180654/bld/cesm.exe >> cesm.log.$LID 2>&1 ' failed
See log file for details: /work/n02/n02/csymonds/cesm/CESM2.1.3/runs/scripts_regression_test.20210922_180654/TESTRUNFAIL_P1.f19_g16_rx1.A.archer2_gnu.20210922_180654/run/cesm.log.523850.210922-180717
and the log indicated merely says:
Insta fail
srun: error: nid001341: task 0: Exited with exit code 255
srun: Terminating job step 523850.0
which is not very informative. Am I looking in the wrong place for information?