worleyph@gmail_com
New Member
Probably a compiler/runtime bug rather than a code error, but building CAM5 with cosp (either version 1.3 or 1.4) and OpenMP and running with more than one thread on Mira fails during the first call tocospsimulator_intr_runwith the error (in the core file)***FAULT Encountered unhandled signal 0x0000000b (11) (SIGSEGV)
Generated by interrupt..................0x00000008 (Data TLB Miss Exception DEAR=0x0000001c09ebf160 ESR=0x0000000000800000)
The happens every time. Occurs with multiple versions of CESM, but verified most recently withcesm1_4_beta03./create_newcase -case XXX -compset FAMIPC5 -mach mira -res ne30_g16
(in env_build.xml)
(in env_mach_pes.xml, making sure that there is plenty of memory available)
...
(in user_nl_cam, and with any other cosp settings, including disabling most of the individual options)docosp = .true.
I also tried varying, in env_mach_specific, setenv XLSMPOPTS "stack=XXX"up to as large as "stack=1024000000"
Generated by interrupt..................0x00000008 (Data TLB Miss Exception DEAR=0x0000001c09ebf160 ESR=0x0000000000800000)
The happens every time. Occurs with multiple versions of CESM, but verified most recently withcesm1_4_beta03./create_newcase -case XXX -compset FAMIPC5 -mach mira -res ne30_g16
(in env_build.xml)
(in env_mach_pes.xml, making sure that there is plenty of memory available)
...
(in user_nl_cam, and with any other cosp settings, including disabling most of the individual options)docosp = .true.
I also tried varying, in env_mach_specific, setenv XLSMPOPTS "stack=XXX"up to as large as "stack=1024000000"