Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

F1850 f19g16 cesm1.2 program stucked

cesm1_2_0 F1850 f19g16
The program stuck at Initialize ocn component OCN,but with no error in the log.
I instrument some print into the source code, find that the program stopped at shr_mpi_mod.F90, SUBROUTINE shr_mpi_bcastc0(vec,comm,string,pebcast) call MPI_BCAST(vec, lsize,MPI_CHARACTER,lpebcast,comm,ierr).

I also try F1850 T31g37 and F1850 f19f19 and F2000f19g16, the problem are the same!
And I have no idea how to fix it. CALL FOR HELP!! Thanks a lot.
my log is in the attachment.
 

jedwards

CSEG and Liaisons
Staff member
First try updating to 1.2.2 and see if your problem has been resolved.   If that doesn't work let us know what compiler and mpi library you are using and we'll see if we can reproduce the problem.
 

jedwards

CSEG and Liaisons
Staff member
I think that your compiler may be too old.   But it's difficult to tell - can you send the cesm.log output?
 
Many thanks. The log  is in the attachment. I am not sure it is the compiler's problem. I update the compiler to  icc version 14.0.2 mpi 3.2. It's  still not work.
 

jedwards

CSEG and Liaisons
Staff member
If the program is hanging in an mpi_bcast it's becasue at least one process in the communicator is not arriving at the call.   I can't tell from your logs which it might be or why.    Try adding the task id to your debugging and see if you can figure out which process.
 
Top