To Whom this may concern,
I am porting CESM2 to the Niagara supercomputer on SciNet. I have managed to complete builds and submit single jobs (successfully), but the resubmission tool is not working.
I receive a "Jobs can only be submitted from the login node" SBATCH error (using slurm scheduler).
I have tried modifying the batch_submit in config_batch.xml as follows:
<batch_submit>ssh -t nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch"</batch_submit>
This fails as the quotes really need to go around the whole submission argument, ie...
ssh nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch --time 12:00:00 --mail-user nstant@my.yorku.ca --mail-type all .case.run --resubmit"
But what I currently have it, it does this:
ssh nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch" --time 12:00:00 --mail-user nstant@my.yorku.ca --mail-type all .case.run --resubmit
Note the bolded ".
Is there another workaround to modify either the batch_submit command or change batch_config.xml in some other way to allow for resubmission of jobs to occur on the login node (with computation of the job on the compute nodes)? I am currently working with sysadmin to find a workaround as well.
Thank you in advance!
Sincerely,
Noah Stanton
I am porting CESM2 to the Niagara supercomputer on SciNet. I have managed to complete builds and submit single jobs (successfully), but the resubmission tool is not working.
I receive a "Jobs can only be submitted from the login node" SBATCH error (using slurm scheduler).
I have tried modifying the batch_submit in config_batch.xml as follows:
<batch_submit>ssh -t nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch"</batch_submit>
This fails as the quotes really need to go around the whole submission argument, ie...
ssh nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch --time 12:00:00 --mail-user nstant@my.yorku.ca --mail-type all .case.run --resubmit"
But what I currently have it, it does this:
ssh nia-login01 "cd $PROJECT/cesm2_1_3_OUT/$CASE; sbatch" --time 12:00:00 --mail-user nstant@my.yorku.ca --mail-type all .case.run --resubmit
Note the bolded ".
Is there another workaround to modify either the batch_submit command or change batch_config.xml in some other way to allow for resubmission of jobs to occur on the login node (with computation of the job on the compute nodes)? I am currently working with sysadmin to find a workaround as well.
Thank you in advance!
Sincerely,
Noah Stanton