Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

cesm.exe terminated with signal 11 at PC SP Backtrace

acgvar

Alvin C.G. Varquez
Member
Dear everyone,

I have reached a point when I could successfully build the model but it stops immediately after 2 minutes of ./case.submit displaying the error above and expanded below (Note: DEBUG also set to True).
FFLAGS = -qno-opt-dynamic-align -convert big_endian -assume byterecl -ftz -traceback -assume realloc_lhs -fp-model source -dynamic -mkl=sequential -no-fma -O -fPIC -m64
CFLAGS = -dynamic -mkl=sequential -no-fma -O -fPIC -m64 -std=c99

The case was based on the tutorial: ./create_newcase --case ~/cases/b.day1.0 --res f19_g17 --compset B1850
./preview_run shows:
mpirun -ppn 28 -n 168 cesm.exe
./xmlquery NTASKS,NTHRDS,ROOTPE (automatically generated after running ./create_newcase)
NTASKS: ['CPL:112', 'ATM:112', 'LND:56', 'ICE:56', 'OCN:56', 'ROF:56', 'GLC:28', 'WAV:28', 'ESP:1']
NTHRDS: ['CPL:1', 'ATM:1', 'LND:1', 'ICE:1', 'OCN:1', 'ROF:1', 'GLC:1', 'WAV:1', 'ESP:1']
ROOTPE: ['CPL:0', 'ATM:0', 'LND:0', 'ICE:56', 'OCN:112', 'ROF:0', 'GLC:0', 'WAV:0', 'ESP:0']

The resulting cesm.log is truncated below:
Invalid PIO rearranger comm max pend req (comp2io), 0
Resetting PIO rearranger comm max pend req (comp2io) to 64
PIO rearranger options:
comm type =
p2p

comm fcd =
2denable

max pend req (comp2io) = 0
enable_hs (comp2io) = T
enable_isend (comp2io) = F
max pend req (io2comp) = 64
enable_hs (io2comp) = F
enable_isend (io2comp) = T
(seq_comm_setcomm) init ID ( 1 GLOBAL ) pelist = 0 167 1 ( npes = 168) ( nthreads = 1)( suffix =)
(seq_comm_setcomm) init ID ( 2 CPL ) pelist = 0 111 1 ( npes = 112) ( nthreads = 1)( suffix =)
(seq_comm_setcomm) init ID ( 5 ATM ) pelist = 0 111 1 ( npes = 112) ( nthreads = 1)( suffix =)
....
WARNING: Rearr optional argument is a pio2 feature, ignored in pio1
WARNING: Rearr optional argument is a pio2 feature, ignored in pio1
WARNING: Rearr optional argument is a pio2 feature, ignored in pio1
WARNING: Rearr optional argument is a pio2 feature, ignored in pio1
....
init_overflows_kmt: KMT = 33 at global (i,j) = 19 370 changed to 32
init_overflows_kmt: KMT = 33 at global (i,j) = 19 371 changed to 32
init_overflows_kmt: KMT = 33 at global (i,j) = 19 372 changed to 32
Overflow: Denmark Strait Inflow region mask at global (ij)= 10 360
Overflow: Denmark Strait Inflow region mask at global (ij)= 11 360
Overflow: Denmark Strait Inflow region mask at global (ij)= 12 360
Overflow: Denmark Strait Inflow region mask at global (ij)= 13 360
Overflow: Denmark Strait Inflow region mask at global (ij)= 14 360
....
no dedicated output process, any file system
starttype: initial

Output requests :
--------------------------------------------------
no dedicated output process, any file system
1 IMOD, NAPROC, NBLKRS, NSPEC, RSBLKS= 1 28 0
600 0
2 IMOD, NAPROC, NBLKRS, NSPEC, RSBLKS= 1 28 10
600 11
1 IMOD, NAPROC, NBLKRS, NSPEC, RSBLKS= 1 28 -414264587
600 1105240874
...
MCT::m_Router::initp_: RGSMap indices not increasing...Will correct
MCT::m_Router::initp_: RGSMap indices not increasing...Will correct
MCT::m_Router::initp_: GSMap indices not increasing...Will correct
(seq_domain_areafactinit) : min/max mdl2drv 0.999565346641447 1.00000000000000 areafact_o_OCN
(seq_domain_areafactinit) : min/max drv2mdl 1.00000000000000 1.00043484236425 areafact_o_OCN
(seq_domain_areafactinit) : min/max mdl2drv 0.999565346641447 1.00000000000000 areafact_i_ICE
(seq_domain_areafactinit) : min/max drv2mdl 1.00000000000000 1.00043484236425 areafact_i_ICE
calcsize j,iq,jac, lsfrm,lstoo 1 1 1 26 21
calcsize j,iq,jac, lsfrm,lstoo 1 1 2 26 21
calcsize j,iq,jac, lsfrm,lstoo 1 2 1 22 15
calcsize j,iq,jac, lsfrm,lstoo 1 2 2 22 15
calcsize j,iq,jac, lsfrm,lstoo 1 3 1 24 17
calcsize j,iq,jac, lsfrm,lstoo 1 3 2 24 17
calcsize j,iq,jac, lsfrm,lstoo 1 4 1 25 20
calcsize j,iq,jac, lsfrm,lstoo 1 4 2 25 20
calcsize j,iq,jac, lsfrm,lstoo 1 5 1 23 19
calcsize j,iq,jac, lsfrm,lstoo 1 5 2 23 19
calcsize j,iq,jac, lsfrm,lstoo 2 1 1 21 26
calcsize j,iq,jac, lsfrm,lstoo 2 1 2 21 26
calcsize j,iq,jac, lsfrm,lstoo 2 2 1 15 22
calcsize j,iq,jac, lsfrm,lstoo 2 2 2 15 22
calcsize j,iq,jac, lsfrm,lstoo 2 3 1 17 24
calcsize j,iq,jac, lsfrm,lstoo 2 3 2 17 24
calcsize j,iq,jac, lsfrm,lstoo 2 4 1 20 25
calcsize j,iq,jac, lsfrm,lstoo 2 4 2 20 25
calcsize j,iq,jac, lsfrm,lstoo 2 5 1 19 23
calcsize j,iq,jac, lsfrm,lstoo 2 5 2 19 23

cesm.exe:46568 terminated with signal 11 at PC=460e3b SP=7fffffff02a0. Backtrace:

cesm.exe:46569 terminated with signal 11 at PC=460e3b SP=7fffffff02a0. Backtrace:

cesm.exe:46570 terminated with signal 11 at PC=460e3b SP=7fffffff02a0. Backtrace:
 

acgvar

Alvin C.G. Varquez
Member
In addition, '.bashrc' stacksize was already set to:
ulimit -s unlimited
export MP_STACK_SIZE=64000000
export OMP_STACKSIZE=64000000
 
Top