So I had a stand-alone FV CAM run going last night, this was the second restart. I had simulated about 6 months successfully. This morning I find this email
So I check "poe.stderr.473359" and its blank. Then I check poe.stdout.473359 and it says:
Then I checked cpl.log.120330-025300 as well as atm.log.120330-025300 but there's no mention of any error or problem. The model just stopped.
So my question is, what can do to figure out what the problem is?
...
Exited with exit code 134.
Resource usage summary:
CPU time : 369921.59 sec.
Max Memory : 61252 MB
Max Swap : 59964 MB
Max Processes : 262
Max Threads : 6280
Read file for stdout output of this job.
Read file for stderr output of this job.
Exited with exit code 134.
Resource usage summary:
CPU time : 369921.59 sec.
Max Memory : 61252 MB
Max Swap : 59964 MB
Max Processes : 262
Max Threads : 6280
Read file for stdout output of this job.
Read file for stderr output of this job.
So I check "poe.stderr.473359" and its blank. Then I check poe.stdout.473359 and it says:
...
CCSM PRESTAGE SCRIPT STARTING
- CCSM input data directory, DIN_LOC_ROOT_CSMDATA, is /fis/cgd/cseg/csm/inputdata
- Case input data directory, DIN_LOC_ROOT, is /fis/cgd/cseg/csm/inputdata
- Checking the existence of input datasets in DIN_LOC_ROOT
CCSM PRESTAGE SCRIPT HAS FINISHED SUCCESSFULLY
Fri Mar 30 02:53:05 MDT 2012 -- CSM EXECUTION BEGINS HERE
Fri Mar 30 04:49:17 MDT 2012 -- CSM EXECUTION HAS FINISHED
Model did not complete - see /ptmp/whannah/raspe_00/run/cpl.log.120330-025300
CCSM PRESTAGE SCRIPT STARTING
- CCSM input data directory, DIN_LOC_ROOT_CSMDATA, is /fis/cgd/cseg/csm/inputdata
- Case input data directory, DIN_LOC_ROOT, is /fis/cgd/cseg/csm/inputdata
- Checking the existence of input datasets in DIN_LOC_ROOT
CCSM PRESTAGE SCRIPT HAS FINISHED SUCCESSFULLY
Fri Mar 30 02:53:05 MDT 2012 -- CSM EXECUTION BEGINS HERE
Fri Mar 30 04:49:17 MDT 2012 -- CSM EXECUTION HAS FINISHED
Model did not complete - see /ptmp/whannah/raspe_00/run/cpl.log.120330-025300
Then I checked cpl.log.120330-025300 as well as atm.log.120330-025300 but there's no mention of any error or problem. The model just stopped.
So my question is, what can do to figure out what the problem is?