Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

crash of CAM

Dear all,

I am trying to use CCSM to simulate paleoclimate. I have set new boundary conditions for CCSM. For CAM, I only set topography and initial conditions for CAM, but use present trop_mozart and trop_mozart_aero conditions.

However, when I tested the paleoclimate climate simulation, it crashed.

The error message from ccsm.log.
_pmii_daemon(SIGCHLD): PE 54 exit signal Floating point exception
[NID 01048] 2010-06-25 00:45:48 Apid 2118108: initiated application termination
_pmii_daemon(SIGCHLD): PE 20 exit signal Floating point exception
_pmii_daemon(SIGCHLD): PE 32 exit signal Floating point exception
Application 2118108 exit codes: 136
Application 2118108 exit signals: Killed
Application 2118108 resources: utime 0, stime 0

I flagged INFO_DEBUG to 3 in the env_run file. The message in the cpl.log.
……
comm_diag xxx sorr 18 4.7975531588880208000E+16 send atm Sl_ram1
comm_diag xxx sorr 19-1.3315035590722964000E+12 send atm Faxx_taux
comm_diag xxx sorr 20 1.0149440212635571000E+11 send atm Faxx_tauy
comm_diag xxx sorr 21-9.2904287728854384000E+16 send atm Faxx_lat
comm_diag xxx sorr 22-8.4450112160212050000E+14 send atm Faxx_sen
comm_diag xxx sorr 23-2.1675009705992397000E+17 send atm Faxx_lwup
comm_diag xxx sorr 24-3.7146856349002159000E+10 send atm Faxx_evap
comm_diag xxx sorr 25 0.0000000000000000000E+00 send atm Fall_flxdst1
comm_diag xxx sorr 26 0.0000000000000000000E+00 send atm Fall_flxdst2
comm_diag xxx sorr 27 0.0000000000000000000E+00 send atm Fall_flxdst3
comm_diag xxx sorr 28 0.0000000000000000000E+00 send atm Fall_flxdst4

It looks like that the coupler sends flux to CAM, but the CAM never sends flux back to the coupler. Thus, I think the crash must happen in CAM.

The message from atm.log.
……
nstep, te 3 0.38276165984194736E+10 0.38273731590133700E+10 -0.13526895068366029E-01 0.98263002008615993E+05
NSTEP = 3 8.815842196185842E-05 8.137970423932620E-06 294.992 9.82642E+04 4.303263006289341E+00 0.51 0.24
nstep, te 4 0.38276450761196642E+10 0.38277882803329806E+10 0.79571535215858802E-02 0.98264198488480193E+05
NSTEP = 4 8.804683011840985E-05 8.564528212907504E-06 294.961 9.82654E+04 4.425280757015373E+00 0.51 0.41
nstep, te 5 0.38276786536282935E+10 0.38278139344566159E+10 0.75167982696905250E-02 0.98265395007074709E+05
NSTEP = 5 8.796038325599769E-05 9.670888711689632E-06 294.930 9.82666E+04 4.547291286758965E+00 0.51 0.44

CAM crashed at fifth or sixth time step.

I also tested different dtime of CAM. It crashed at 5 time step when dtime is set to 900. It crashed at 5 time step when dtime is set to 1800. It crashed at 3 time step when dtime is set to 3600.

Could you tell me what can cause the crash in CAM? How to read the core output file after crash? How to close chemical and aero models in CAM?

Regards

Zhongshi
 
Top