Continuing my struggles to get a global 0.5° run working, I'm now specifying
It works for a few years, but then crashes on October 29, 1980 after less than two realtime hours. I've tried twice, and it doesn't crash on the same timestep (once was at 3600s and the second was at 72000s), which makes me think it might not be a problem with the code.
The first time it crashed, I got the following error in cesm.log:
No error was given the second time it crashed.
Any ideas of where I should start looking for the issue? Thanks as always.
NTASKS: ['CPL:3600', 'ATM:3600', 'LND:3600', 'ICE:3600', 'OCN:3600', 'ROF:3600', 'GLC:3600', 'WAV:3600', 'ESP:100']
. (Suggestions appreciated if any of those seem off to you!)It works for a few years, but then crashes on October 29, 1980 after less than two realtime hours. I've tried twice, and it doesn't crash on the same timestep (once was at 3600s and the second was at 72000s), which makes me think it might not be a problem with the code.
The first time it crashed, I got the following error in cesm.log:
Code:
18:MPT ERROR: Rank 18(g:18) received signal SIGBUS(7).
18: Process ID: 13137, Host: r3i1n7, Program: /glade/scratch/samrabin/halfdeg_test_20220425/bld/cesm.exe
18: MPT Version: HPE MPT 2.22 03/31/20 15:59:10
18:
18:MPT: --------stack traceback-------
-1:MPT ERROR: MPI_COMM_WORLD rank 30 has terminated without calling MPI_Finalize()
-1: aborting job
MPT: Received signal 7
No error was given the second time it crashed.
Any ideas of where I should start looking for the issue? Thanks as always.