Issue with negative block size

jeani@uio_no

New Member
Hi,

I fail to run CESM2.1.1 with --compset B1850 --res f19_g17 for no obvious reason, on several machines, with various number of CPUs (from 16 to 1024) and different compilers (including GNU 7.3.0 and Intel 2018.1 / 2019.5).

The simulation simply crashes, and when looking at the cesm.log I noticed that there were sometimes strange values for the block sizes, for instance:

1 IMOD, NAPROC, NBLKRS, NSPEC, RSBLKS= 1 32 538976288 600 538976288
or
1 IMOD, NAPROC, NBLKRS, NSPEC, RSBLKS= 1 32 -536870912 600 -956378981

This seem to come from components/ww3/src/source/w3odatmd.f90 where NBLKRS is the "Number of blocks in communication of spectra" and RSBLKS is the "Corresponding block size".

Can that be the cause of the failure?

Can these number be negative?

Regards,

Jean
 

jedwards

CSEG and Liaisons
Staff member
I don't recognize this error. What is the pelayout? Run script ./pelayout in your case directory.
 

jeani@uio_no

New Member
Hi Jim,

The error occurs whatever the PE layout, but for this example it was:

./pelayout
Comp NTASKS NTHRDS ROOTPE
CPL : 32/ 1; 0
ATM : 32/ 1; 0
LND : 32/ 1; 0
ICE : 32/ 1; 0
OCN : 32/ 1; 0
ROF : 32/ 1; 0
GLC : 32/ 1; 0
WAV : 32/ 1; 0
ESP : 1/ 1; 0

Does this help?

Mvh

Jean
 
Back
Top