11438023@zju_edu_cn
Member
Hi there,I meet a strange problem during my 4xCO2 sensitivity experiment.My CESM version is 1.2.2, and the compset is E1850C5CN with CN (CLM) mode turned off. Firstly I ran 100 years to spin up from coldstart, and the restart fiies' date is 101-01-01;Then I set up a hybrid run, changing co2 concentration to 4 times its preindustrial value while other parameter stay unchagned, and start my experiment with the restart file described above (RUN_REFDATE=0101-01-01, STOP_OPTION=nyears, STOP_N=6);But the experiment stops every time it reaches year 143 and month 7, and there have no obvious error shown in the log file (see below). I have excluded the cause of nodes, so does anyone know why this problem happened? Thanks very much!
cesm.log######################################## ########################## QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 901 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-5.7E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 570 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-6.1E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 415 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.9E-12 at i,k= 1 1 filew failed, worst i, j, qtmp, q = 1 73 -6.265316431815470E-013 0.000000000000000E+000 filew failed, worst i, j, qtmp, q = 1 73 -7.505473598505249E-015 0.000000000000000E+000 dpcoup cant adjust 3 561 8 -5.080309997666980E-018 0.000000000000000E+000 4.311432476213216E-018 QNEG3 from convect_deep/CLDLIQ:m= 2 lat/lchnk= 302 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.2E-12 at i,k= 4 27 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 264 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.0E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 336 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-7.8E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 334 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.3E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 1034 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-5.1E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 886 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-6.2E-09 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 564 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.1E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 833 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.4E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 563 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.1E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 563 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-3.8E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 564 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.4E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 836 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-3.8E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 227 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-8.0E-11 at i,k= 1 1forrtl: error (65): floating invalidImage PC Routine Line Sourcecesm.exe 00000000006AB42F mo_usrrxt_mp_usrr 581 mo_usrrxt.F90cesm.exe 000000000061D8E6 mo_gas_phase_chem 724 mo_gas_phase_chemdr.F90cesm.exe 00000000005B1DFE chemistry_mp_chem 1473 chemistry.F90cesm.exe 0000000000743FEE physpkg_mp_tphysa 1396 physpkg.F90cesm.exe 0000000000741B45 physpkg_mp_phys_r 1131 physpkg.F90cesm.exe 0000000000550F7D cam_comp_mp_cam_r 300 cam_comp.F90cesm.exe 000000000054515C atm_comp_mct_mp_a 539 atm_comp_mct.F90cesm.exe 00000000004BDD2D ccsm_comp_mod_mp_ 4079 ccsm_comp_mod.F90cesm.exe 00000000004E6B35 MAIN__ 91 ccsm_driver.F90cesm.exe 00000000004B733C Unknown Unknown Unknownlibc.so.6 0000003BD661ECDD Unknown Unknown Unknowncesm.exe 00000000004B7239 Unknown Unknown Unknownyhrun: error: cn2980: task 54: Abortedyhrun: First task exited 60s agoyhrun: tasks 0-53,55-119: runningyhrun: task 54: exited abnormallyyhrun: Terminating job step 5127560.0yhrun: Job step aborted: Waiting up to 2 seconds for job step to finish.slurmd[cn2655]: *** STEP 5127560.0 KILLED AT 2016-06-17T17:44:02 WITH SIGNAL 9 ***slurmd[cn2655]: *** STEP 5127560.0 KILLED AT 2016-06-17T17:44:02 WITH SIGNAL 9 ***yhrun: error: cn4155: tasks 108-119: Killed###################################################################
Best regard,Duan
cesm.log######################################## ########################## QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 901 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-5.7E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 570 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-6.1E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 415 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.9E-12 at i,k= 1 1 filew failed, worst i, j, qtmp, q = 1 73 -6.265316431815470E-013 0.000000000000000E+000 filew failed, worst i, j, qtmp, q = 1 73 -7.505473598505249E-015 0.000000000000000E+000 dpcoup cant adjust 3 561 8 -5.080309997666980E-018 0.000000000000000E+000 4.311432476213216E-018 QNEG3 from convect_deep/CLDLIQ:m= 2 lat/lchnk= 302 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.2E-12 at i,k= 4 27 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 264 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.0E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 336 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-7.8E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 334 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.3E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 1034 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-5.1E-12 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 886 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-6.2E-09 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 564 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.1E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 833 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.4E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 563 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-1.1E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 563 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-3.8E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 564 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-2.4E-10 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 836 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-3.8E-11 at i,k= 1 1 QNEG3 from TPHYSBCb:m= 5 lat/lchnk= 227 Min. mixing ratio violated at 1 points. Reset to 0.0E+00 Worst =-8.0E-11 at i,k= 1 1forrtl: error (65): floating invalidImage PC Routine Line Sourcecesm.exe 00000000006AB42F mo_usrrxt_mp_usrr 581 mo_usrrxt.F90cesm.exe 000000000061D8E6 mo_gas_phase_chem 724 mo_gas_phase_chemdr.F90cesm.exe 00000000005B1DFE chemistry_mp_chem 1473 chemistry.F90cesm.exe 0000000000743FEE physpkg_mp_tphysa 1396 physpkg.F90cesm.exe 0000000000741B45 physpkg_mp_phys_r 1131 physpkg.F90cesm.exe 0000000000550F7D cam_comp_mp_cam_r 300 cam_comp.F90cesm.exe 000000000054515C atm_comp_mct_mp_a 539 atm_comp_mct.F90cesm.exe 00000000004BDD2D ccsm_comp_mod_mp_ 4079 ccsm_comp_mod.F90cesm.exe 00000000004E6B35 MAIN__ 91 ccsm_driver.F90cesm.exe 00000000004B733C Unknown Unknown Unknownlibc.so.6 0000003BD661ECDD Unknown Unknown Unknowncesm.exe 00000000004B7239 Unknown Unknown Unknownyhrun: error: cn2980: task 54: Abortedyhrun: First task exited 60s agoyhrun: tasks 0-53,55-119: runningyhrun: task 54: exited abnormallyyhrun: Terminating job step 5127560.0yhrun: Job step aborted: Waiting up to 2 seconds for job step to finish.slurmd[cn2655]: *** STEP 5127560.0 KILLED AT 2016-06-17T17:44:02 WITH SIGNAL 9 ***slurmd[cn2655]: *** STEP 5127560.0 KILLED AT 2016-06-17T17:44:02 WITH SIGNAL 9 ***yhrun: error: cn4155: tasks 108-119: Killed###################################################################
Best regard,Duan