Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

errors occur when the cesm2.1.3 start running

Dear sir
When i submit the cesm.exe to run the cesm2.1.3with the command
yhbatch -N 4 -n 128 -p TH_HPC3N submit.sh
i meet the pio problems.
Invalid PIO rearranger comm max pend req (comp2io), 0
Resetting PIO rearranger comm max pend req (comp2io) to 64
PIO rearranger options:
comm type =
p2p

comm fcd =
2denable

max pend req (comp2io) = 0
enable_hs (comp2io) = T
enable_isend (comp2io) = F
max pend req (io2comp) = 64
enable_hs (io2comp) = F
enable_isend (io2comp) = T
(seq_comm_setcomm) init ID ( 1 GLOBAL ) pelist = 0 127 1 ( npes = 128) ( nthreads = 1)( suffix =)
(seq_comm_setcomm) init ID ( 2 CPL ) pelist = 0 63 1 ( npes = 64) ( nthreads = 1)( suffix =)
(seq_comm_setcomm) init ID ( 5 ATM ) pelist = 0 127 1 ( npes = 128) ( nthreads = 1)( suffix =)
(seq_comm_joincomm) init ID ( 6 CPLATM ) join IDs = 2 5 ( npes = 128) ( nthreads = 1)
(seq_comm_jcommarr) init ID ( 3 ALLATMID ) join multiple comp IDs ( npes = 128) ( nthreads = 1)
(seq_comm_joincomm) init ID ( 4 CPLALLATMID ) join IDs = 2 3 ( npes = 128) ( nthreads = 1)
(seq_comm_setcomm) init ID ( 9 LND ) pelist = 0 63 1 ( npes = 64) ( nthreads = 1)( suffix =)
(seq_comm_joincomm) init ID ( 10 CPLLND ) join IDs = 2 9 ( npes = 64) ( nthreads = 1)
(seq_comm_jcommarr) init ID ( 7 ALLLNDID ) join multiple comp IDs ( npes = 64) ( nthreads = 1)
(seq_comm_joincomm) init ID ( 8 CPLALLLNDID ) join IDs = 2 7 ( npes = 64) ( nthreads = 1)
(seq_comm_setcomm) init ID ( 13 ICE ) pelist = 0 63 1 ( npes = 64) ( nthreads = 1)( suffix =)
(seq_comm_joincomm) init ID ( 14 CPLICE ) join IDs = 2 13 ( npes = 64) ( nthreads = 1)
(seq_comm_jcommarr) init ID ( 11 ALLICEID ) join multiple comp IDs ( npes = 64) ( nthreads = 1)
(seq_comm_joincomm) init ID ( 12 CPLALLICEID ) join IDs = 2 11 ( npes = 64) ( nthreads = 1)
(seq_comm_setcomm) init ID ( 17 OCN ) pelist = 0 63 1 ( npes = 64) ( nthreads = 1)( suffix =)
(seq_comm_joincomm) init ID ( 18 CPLOCN ) join IDs = 2 17 ( npes = 64) ( nthreads = 1)
(seq_comm_jcommarr) init ID ( 15 ALLOCNID ) join multiple comp IDs ( npes = 64) ( nthreads = 1)
(seq_comm_joincomm) init ID ( 16 CPLALLOCNID ) join IDs = 2 15 ( npes = 64) ( nthreads = 1)
........................
I only submit the job with no batch system, directly submiting with the cesm.exe. I have checked out the previous posts to find answers to this questions.
I don't quite understand how to solve this problem, I need some help, thanks for your attention and help.
 

fischer

CSEG and Liaisons
Staff member
You need to use the case.submit script to run your jobs. It sets several environment variables that are needed to run.
To run locally you do case.submit --no-batch.

Chris
 
Dear sir
I think the two statements are the errors points. One is GPTLstopf thread 0: timer for "a:PIO:pre_pio_write_nf" had not been started, the other is
newchild: child "a:PIO:pio_write_darray" can't be a parent of itself.
I don't search the relevant content in the forum. Could i have your help?
Thanks a lot.
Yuhang
 

fischer

CSEG and Liaisons
Staff member
I'm not sure what's happening. Can you provide information that's requested in

Thanks
Chris
 

jeffhu

jeffhu
New Member
Dear sir
I think the two statements are the errors points. One is GPTLstopf thread 0: timer for "a:PIO:pre_pio_write_nf" had not been started, the other is
newchild: child "a:PIO:pio_write_darray" can't be a parent of itself.
I don't search the relevant content in the forum. Could i have your help?
Thanks a lot.
Yuhang
Hi Yuhang, have you solved that? I have similar errors.
 

Sreerag

Sreerag
New Member
Do you find any resolution for this problem, I am also facing the same. If anyone know please command on it
Dear sir
I think the two statements are the errors points. One is GPTLstopf thread 0: timer for "a:PIO:pre_pio_write_nf" had not been started, the other is
newchild: child "a:PIO:pio_write_darray" can't be a parent of itself.
I don't search the relevant content in the forum. Could i have your help?
Thanks a lot.
Yuhang

Thanks in advance
Sreerag
 

katec

CSEG and Liaisons
Staff member
Hi Sreerag,
I don't know specifically what this error means, but it could be related to how the PIO library is linked or loaded by the machine. The first poster was not using case.submit, which sets up many libraries and environment configurations. If you are not using case.submit to submit your job, then you should do that. If you are using case.submit and still getting this error than please send more information about your machine and experiment set up as in this: Information to include in help requests
 
Top