Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

BalanceCheck: soil balance error nstep

I am running a case F_2000_CAM5, ne120_ne120, and model has ran for 5 days without a problem. Then I set a continue run, but it crased.  3280: BalanceCheck: soil balance error nstep =      1860 point =437724 imbalance =   -0.000000 W/m22791: BalanceCheck: soil balance error nstep =      1860 point =372228 imbalance =   -0.000005 W/m23084: BalanceCheck: soil balance error nstep =      1860 point =410627 imbalance =   -0.000004 W/m21918: BalanceCheck: soil balance error nstep =      1860 point =259906 imbalance =   -0.000004 W/m22866: QNEG3 from vertical diffusion/SO2:m=  8 lat/lchnk= 109282 Min. mixing ratio violated at    1 points.  Reset to  1.0E-36 Worst =-1.8E-12 at i,k=  13 30INFO: 0031-029  Caught signal 12 (User defined signal 2), sending to tasks...ERROR: 0031-161  EOF on socket connection with node ys1303-ib My questions are: - "ERROR: 0031-161  EOF on socket connection with node xxx" Is this the system problem?- "BalanceCheck: soil balance error nstep xxx" Where does this error come from? Here is the case directory:/glade/u/home/yingli/cesm/runs/f.F2000C5.ne120_ne120.test.008
 

santos

Member
This usually means that the process ran out of time before it completed. So you may have to increase the time at the top of your run script, or reduce the number of days you run for (STOP_N and STOP_OPTION. The maximum on yellowstone is 12 hours, which would be written like this in your run script:#BSUB -W 12:00I do not know what BalanceCheck is. It is not really an error, but a very common warning. It doesn't seem to have anything to do with your run stopping.
 

santos

Member
This usually means that the process ran out of time before it completed. So you may have to increase the time at the top of your run script, or reduce the number of days you run for (STOP_N and STOP_OPTION. The maximum on yellowstone is 12 hours, which would be written like this in your run script:#BSUB -W 12:00I do not know what BalanceCheck is. It is not really an error, but a very common warning. It doesn't seem to have anything to do with your run stopping.
 
Top