Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

Program received signal SIGSEGV: Segmentation fault - invalid memory reference.

I'm attempting to port CESM 2.1.3 to my local cluster and run a CIAF compset. The model builds successfully but fails at runtime with the SIGSEGV error above. I am using a GNU compiler with netcdf-c v4.8, netcdf-fortran v4.8, and openmpi v5.1.0. I've attached the cesm.log file, cesm.bldlog, pio.bldlog as well as my config_machines and config_compiler xml files. See the last entry in both xml files for my machine named "marvin". I have run with DEBUG=TRUE but still can't figure out the traceback of errors.
 

Attachments

  • config_machines.xml.txt
    109.2 KB · Views: 6
  • config_compilers.xml.txt
    41.3 KB · Views: 4
  • cesm.log.9483.220329-092915.txt
    44.1 KB · Views: 20
  • cesm.bldlog.220329-092708.txt
    52.2 KB · Views: 8
  • pio.bldlog.220329-092708.txt
    214.1 KB · Views: 5

jedwards

CSEG and Liaisons
Staff member
Traceback shows that you are failing in the call to mpi_init. Have you run an mpi_hello_world to confirm that
mpi is working on your system? Also check your ulimits and make sure that the stack value is set to unlimited or
the maximum allowed value. If nether of these work please consult with your system administators for further suggestions.
 

taoliu_tech

Tao Liu
Member
I'm attempting to port CESM 2.1.3 to my local cluster and run a CIAF compset. The model builds successfully but fails at runtime with the SIGSEGV error above. I am using a GNU compiler with netcdf-c v4.8, netcdf-fortran v4.8, and openmpi v5.1.0. I've attached the cesm.log file, cesm.bldlog, pio.bldlog as well as my config_machines and config_compiler xml files. See the last entry in both xml files for my machine named "marvin". I have run with DEBUG=TRUE but still can't figure out the traceback of errors.
Did you solve the error? I received the same error.
 
Top