jihwang@colorado_edu
Member
Hello All:
I was wondering if anyone can help me with the ne240 grid for the CESM model run. I tried to make a fairly simple CESM run, something like -res FIDEAL, or a CAM only run with prescribed SST and sea ice. I tested the configurations on ne30_g16 and ne120_g16 grids and they ran just fine. However, when I tried the same configurations with ne60_ne60, ne60_g16, ne240_ne240, ne240_g16, ne240_f02_t12, ne240_f02_g16, and ne240_t12 grids, they all failed. In other words, ne60 and ne240 do not work. I can't understand why the model stops after maybe the first time step. The error messages only said the run was terminated. ------------------------- srun: error: nid00175: tasks 696,698,700,702,704,706,708,710,714,716,718: Killed srun: Terminating job step 73821.0 srun: error: nid00076: tasks 0-16,18-23: Killed srun: error: nid00175: tasks 697,701,703,705,707,709,711,713,715,717: Killed 0000: slurmstepd: *** STEP 73821.0 ON nid00076 CANCELLED AT 2016-01-27T09:20:15 *** srun: Job step aborted: Waiting up to 32 seconds for job step to finish. srun: error: nid00204: tasks 1202,1216: Killed srun: error: nid00175: task 712: Killed ... I am thinking maybe I made some mistake even though everything is the same for a successful ne120_g16 run. Can anyone give me some hint or point me to any person who knows the answer? By the way, I tried NERSC's Edison and NOAA's Gaea. They both have the same issues. The same configurations on ne30_g16 and ne120_g16 are just fine for running a long time.Thank you so much.
I was wondering if anyone can help me with the ne240 grid for the CESM model run. I tried to make a fairly simple CESM run, something like -res FIDEAL, or a CAM only run with prescribed SST and sea ice. I tested the configurations on ne30_g16 and ne120_g16 grids and they ran just fine. However, when I tried the same configurations with ne60_ne60, ne60_g16, ne240_ne240, ne240_g16, ne240_f02_t12, ne240_f02_g16, and ne240_t12 grids, they all failed. In other words, ne60 and ne240 do not work. I can't understand why the model stops after maybe the first time step. The error messages only said the run was terminated. ------------------------- srun: error: nid00175: tasks 696,698,700,702,704,706,708,710,714,716,718: Killed srun: Terminating job step 73821.0 srun: error: nid00076: tasks 0-16,18-23: Killed srun: error: nid00175: tasks 697,701,703,705,707,709,711,713,715,717: Killed 0000: slurmstepd: *** STEP 73821.0 ON nid00076 CANCELLED AT 2016-01-27T09:20:15 *** srun: Job step aborted: Waiting up to 32 seconds for job step to finish. srun: error: nid00204: tasks 1202,1216: Killed srun: error: nid00175: task 712: Killed ... I am thinking maybe I made some mistake even though everything is the same for a successful ne120_g16 run. Can anyone give me some hint or point me to any person who knows the answer? By the way, I tried NERSC's Edison and NOAA's Gaea. They both have the same issues. The same configurations on ne30_g16 and ne120_g16 are just fine for running a long time.Thank you so much.