Hi all,
I have a CLM5 case which sometimes runs, but mostly doesn't. I wanted to ask if there is anything I can do to reduce the high crash rate. I am running over a custom domain (sub-Saharan Africa) at a 0.5 degree resolution and outputting CLM5 history fields per PFT as well as on the lon/lat grid. When the model fails, it seem to be after the completion of the first timesteps, implying that the error is related to the process of writing the history files.
The error message in the CESM log is
1369:MPT ERROR: Rank 1369(g:1369) received signal SIGSEGV(11).
1369: Process ID: 28314, Host: r14i7n22, Program: /glade/scratch/jamesking/i.clm5.AfrSSP126_allforcings.000/bld/cesm.exe
1369: MPT Version: HPE MPT 2.21 11/28/19 04:21:40
There are also some NetCDF: variable not found errors but I don't think these are pointing me towards the cause of the problem. I'm running CESM2.2.0 on Cheyenne and have attached my log files. Any insight into what's problematic about this case would be much appreciated.
Thanks,
James
I have a CLM5 case which sometimes runs, but mostly doesn't. I wanted to ask if there is anything I can do to reduce the high crash rate. I am running over a custom domain (sub-Saharan Africa) at a 0.5 degree resolution and outputting CLM5 history fields per PFT as well as on the lon/lat grid. When the model fails, it seem to be after the completion of the first timesteps, implying that the error is related to the process of writing the history files.
The error message in the CESM log is
1369:MPT ERROR: Rank 1369(g:1369) received signal SIGSEGV(11).
1369: Process ID: 28314, Host: r14i7n22, Program: /glade/scratch/jamesking/i.clm5.AfrSSP126_allforcings.000/bld/cesm.exe
1369: MPT Version: HPE MPT 2.21 11/28/19 04:21:40
There are also some NetCDF: variable not found errors but I don't think these are pointing me towards the cause of the problem. I'm running CESM2.2.0 on Cheyenne and have attached my log files. Any insight into what's problematic about this case would be much appreciated.
Thanks,
James
Attachments
-
atm.log.3137410.chadmin1.ib0.cheyenne.ucar.edu.220304-053704.txt111 KB · Views: 3
-
cesm.log.3137410.chadmin1.ib0.cheyenne.ucar.edu.220304-053704.txt81 KB · Views: 1
-
cpl.log.3137410.chadmin1.ib0.cheyenne.ucar.edu.220304-053704.txt53 KB · Views: 0
-
lnd.log.3137410.chadmin1.ib0.cheyenne.ucar.edu.220304-053704.txt305.3 KB · Views: 1