ekarlsso@purdue_edu
New Member
Hello, we recently switched from pbs to slurm on our HPC system and I now run into some issues with the scripts_regression_tests.
I get the following error when running the K_TestCimeCase tests:
======================================================================
ERROR: test_cime_case_st_archive_resubmit (__main__.K_TestCimeCase)
----------------------------------------------------------------------
Traceback (most recent call last):
File "./scripts_regression_tests.py", line 1672, in test_cime_case_st_archive_resubmit
case.case_st_archive(resubmit=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_st_archive.py", line 760, in case_st_archive
self.submit(resubmit=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 157, in submit
custom_success_msg_functor=verbatim_success_msg)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 1683, in run_and_log_case_status
rv = func()
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 155, in <lambda>
batch_args=batch_args)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 100, in _submit
mail_type=mail_type, batch_args=batch_args)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case.py", line 1203, in submit_jobs
batch_args=batch_args, dry_run=dry_run)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/XML/env_batch.py", line 515, in submit_jobs
dry_run=dry_run)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/XML/env_batch.py", line 699, in _submit_single_job
output = run_cmd_no_fail(submitcmd, combine_output=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 516, in run_cmd_no_fail
expect(False, "Command: '{}' failed with error '{}' from dir '{}'".format(cmd, errput.encode('utf-8'), os.getcwd() if from_dir is None else from_dir))
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 130, in expect
raise exc_type(msg)
SystemExit: ERROR: Command: 'cd $CASEROOT ; sbatch .case.run --resubmit' failed with error 'sbatch: error: Unable to open file .case.run' from dir '/scratch/brown/ekarlsso/scripts_regression_test.20201027_153245/st_archive_resubmit_test'
----------------------------------------------------------------------
Ran 20 tests in 119.295s
FAILED (errors=1, skipped=6)
('Detected failures, leaving directory:', '/scratch/brown/ekarlsso/scripts_regression_test.20201027_153245')
In the test output directory, there is no cesm.log and there are no .nc files (i.e. the st_archive_resubmit_test/run directory is empty. The .case.run is not executable, but is in the directory. Any ideas on where it went wrong? Thank you!
Version info: release-clm5.0.34-3-gcdc544df
Processing externals description file : Externals.cfg
Processing externals description file : Externals_CISM.cfg
Processing externals description file : Externals_CLM.cfg
Checking status of externals: cism, source_cism, clm, fates, ptclm, mosart, cime, rtm,
M ./cime
modified sandbox, on cime5.6.33
./components/cism
clean sandbox, on cism-release-cesm2.1.2_02
./components/cism/source_cism
clean sandbox, on release-cism2.1.03
./components/mosart
clean sandbox, on release-cesm2.0.04
./components/rtm
clean sandbox, on release-cesm2.0.04
./src/fates
clean sandbox, on sci.1.30.0_api.8.0.0
./tools/PTCLM
clean sandbox, on PTCLM2_20200121
I get the following error when running the K_TestCimeCase tests:
======================================================================
ERROR: test_cime_case_st_archive_resubmit (__main__.K_TestCimeCase)
----------------------------------------------------------------------
Traceback (most recent call last):
File "./scripts_regression_tests.py", line 1672, in test_cime_case_st_archive_resubmit
case.case_st_archive(resubmit=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_st_archive.py", line 760, in case_st_archive
self.submit(resubmit=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 157, in submit
custom_success_msg_functor=verbatim_success_msg)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 1683, in run_and_log_case_status
rv = func()
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 155, in <lambda>
batch_args=batch_args)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case_submit.py", line 100, in _submit
mail_type=mail_type, batch_args=batch_args)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/case/case.py", line 1203, in submit_jobs
batch_args=batch_args, dry_run=dry_run)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/XML/env_batch.py", line 515, in submit_jobs
dry_run=dry_run)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/XML/env_batch.py", line 699, in _submit_single_job
output = run_cmd_no_fail(submitcmd, combine_output=True)
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 516, in run_cmd_no_fail
expect(False, "Command: '{}' failed with error '{}' from dir '{}'".format(cmd, errput.encode('utf-8'), os.getcwd() if from_dir is None else from_dir))
File "/scratch/brown/ekarlsso/clm5_porting2020/clm5_porting/clm5.0/cime/scripts/tests/../lib/CIME/utils.py", line 130, in expect
raise exc_type(msg)
SystemExit: ERROR: Command: 'cd $CASEROOT ; sbatch .case.run --resubmit' failed with error 'sbatch: error: Unable to open file .case.run' from dir '/scratch/brown/ekarlsso/scripts_regression_test.20201027_153245/st_archive_resubmit_test'
----------------------------------------------------------------------
Ran 20 tests in 119.295s
FAILED (errors=1, skipped=6)
('Detected failures, leaving directory:', '/scratch/brown/ekarlsso/scripts_regression_test.20201027_153245')
In the test output directory, there is no cesm.log and there are no .nc files (i.e. the st_archive_resubmit_test/run directory is empty. The .case.run is not executable, but is in the directory. Any ideas on where it went wrong? Thank you!
Version info: release-clm5.0.34-3-gcdc544df
Processing externals description file : Externals.cfg
Processing externals description file : Externals_CISM.cfg
Processing externals description file : Externals_CLM.cfg
Checking status of externals: cism, source_cism, clm, fates, ptclm, mosart, cime, rtm,
M ./cime
modified sandbox, on cime5.6.33
./components/cism
clean sandbox, on cism-release-cesm2.1.2_02
./components/cism/source_cism
clean sandbox, on release-cism2.1.03
./components/mosart
clean sandbox, on release-cesm2.0.04
./components/rtm
clean sandbox, on release-cesm2.0.04
./src/fates
clean sandbox, on sci.1.30.0_api.8.0.0
./tools/PTCLM
clean sandbox, on PTCLM2_20200121