CESM run stopped with MPT ERROR

Hi All,

I run CESM successfully for 10 years. I resubmitted the job every 5 years. But when I resubmitted another 5 years (cause I want 15 years in total), it stopped.

In the cesm log file, I found this error: MPT ERROR: MPI_COMM_WORLD rank 347 has terminated without calling MPI_Finalize(). Can anyone help to fix it?

Thanks a lot.

Yaqian
 

erik

Erik Kluzek
CSEG and Liaisons
Staff member
Unfortunately, that's a fairly generic error, so any number of problems could cause it. I suggest looking for errors in all of the different log files for each component to see if something stands out. Also check your batch output and see if you have other clues on the abort. As well as carefully going through the entire cesm.log that you quote above. The problem could be as simple as you ran out of disk quota or ran out of time in your batch job.
 
Back
Top