While running B1850, f19_g17 case, my processing speed remains slow no matter how I change the pelayout. I let the components run in sequence with ROOTPE=0, NTHRDS=1 for all, increased NTASKS from 168 (28pes/node) to 420 and found that the processing speed remain less than 5 years/wallclock day.
Then I changed the OCN to be ran concurrently with other components, I found the best performance is NTASKS_OCN=168 with other NTASKS=336 under NTHRDS=1, the speed is almost 9 years/wallclockday. However, setting NTASKS_OCN=168, other NTASKS=336 as a reference, no matter how I increase the NTASKS of OCN or other components, the speed remains lower than 9.5 years/day. I have checked the timing provided by CESM2's official website and found that they could process with more than 20yrs/d with more than 1000 or even 2000pes at B1850 f19_g17, does anyone know why I couldn't get a higher speed by increasing pes? (My total pes is only no more than several hundreds and the speed won't increase!)
Thanks a lot! Attached is a cpl log for NTASKS=336 for components other than OCN while NTASKS_OCN=168.
Then I changed the OCN to be ran concurrently with other components, I found the best performance is NTASKS_OCN=168 with other NTASKS=336 under NTHRDS=1, the speed is almost 9 years/wallclockday. However, setting NTASKS_OCN=168, other NTASKS=336 as a reference, no matter how I increase the NTASKS of OCN or other components, the speed remains lower than 9.5 years/day. I have checked the timing provided by CESM2's official website and found that they could process with more than 20yrs/d with more than 1000 or even 2000pes at B1850 f19_g17, does anyone know why I couldn't get a higher speed by increasing pes? (My total pes is only no more than several hundreds and the speed won't increase!)
Thanks a lot! Attached is a cpl log for NTASKS=336 for components other than OCN while NTASKS_OCN=168.