Hello DiscussCESM users:
I ran a simulation with 4608 cores on Eiger (similar to Derecho). My understanding is that scaling CESM should be strong up until the hard limits for CAM and POP are hit. However, I have an inconsistency in performance. The slowest performing model was the ATM (12.41 myears/wday). However, the total time was 6.35 myears/wday. I was not expecting this outcome.
Cheers,
-Jonathan
More info on coupler timing:
I ran a simulation with 4608 cores on Eiger (similar to Derecho). My understanding is that scaling CESM should be strong up until the hard limits for CAM and POP are hit. However, I have an inconsistency in performance. The slowest performing model was the ATM (12.41 myears/wday). However, the total time was 6.35 myears/wday. I was not expecting this outcome.
Cheers,
-Jonathan
---------------- TIMING PROFILE --------------------- Case : intel_cesm3_0_beta01_BLT1850_v0c_O4608_01 LID : 3210216.240715-160339 Machine : eiger Caseroot : /capstor/scratch/cscs/jbuzan/cesm3_0_beta01/cases/intel_cesm3_0_beta01_BLT1850_v0c_O4608_01 Timeroot : /capstor/scratch/cscs/jbuzan/cesm3_0_beta01/cases/intel_cesm3_0_beta01_BLT1850_v0c_O4608_01/Tools User : jbuzan Curr Date : Mon Jul 15 16:15:04 2024 Driver : CMEPS grid : a%ne30np4.pg3_l%ne30np4.pg3_oi%tx2_3v2_r%r05_g%gris4_w%null_z%null_m%tx2_3v2 compset : 1850_CAM%DEV%LT%GHGMAM4_CLM51%BGC-CROP_CICE_MOM6_MOSART_CISM2%GRIS-NOEVOLVE_SWAV_SESP run type : startup, continue_run = FALSE (inittype = TRUE) stop option : ndays, stop_n = 5 run length : 5 days (4.958333333333333 for ocean) component comp_pes root_pe tasks x threads instances (stride) --------- ------ ------- ------ ------ --------- ------ cpl = cpl 4608 0 4608 x 1 1 (1 ) atm = cam 3840 0 3840 x 1 1 (1 ) lnd = clm 1280 0 1280 x 1 1 (1 ) ice = cice 2304 1280 2304 x 1 1 (1 ) ocn = mom 768 3840 768 x 1 1 (1 ) rof = mosart 1024 0 1024 x 1 1 (1 ) glc = cism 64 3712 64 x 1 1 (1 ) wav = swav 64 3776 64 x 1 1 (1 ) esp = sesp 1 0 1 x 1 1 (1 ) total pes active : 4608 mpi tasks per node : 128 pe count for cost estimate : 4608 Overall Metrics: Model Cost: 17427.71 pe-hrs/simulated_year Model Throughput: 6.35 simulated_years/day Init Time : 345.573 seconds Run Time : 186.512 seconds 37.302 seconds/day Final Time : 14.170 seconds Runs Time in total seconds, seconds/model-day, and model-years/wall-day CPL Run Time represents time in CPL pes alone, not including time associated with data exchange with other components TOT Run Time: 186.512 seconds 37.302 seconds/mday 6.35 myears/wday CPL Run Time: 7.561 seconds 1.512 seconds/mday 156.54 myears/wday ATM Run Time: 95.364 seconds 19.073 seconds/mday 12.41 myears/wday LND Run Time: 14.947 seconds 2.989 seconds/mday 79.18 myears/wday ICE Run Time: 10.890 seconds 2.178 seconds/mday 108.69 myears/wday OCN Run Time: 65.010 seconds 13.002 seconds/mday 18.21 myears/wday ROF Run Time: 0.577 seconds 0.115 seconds/mday 2051.59 myears/wday GLC Run Time: 0.000 seconds 0.000 seconds/mday 0.00 myears/wday WAV Run Time: 0.000 seconds 0.000 seconds/mday 0.00 myears/wday ESP Run Time: 0.000 seconds 0.000 seconds/mday 0.00 myears/wday CPL COMM Time: 80.675 seconds 16.135 seconds/mday 14.67 myears/wday NOTE: min:max driver timers (seconds/day): CPL (pes 0 to 4607) ATM (pes 0 to 3839) LND (pes 0 to 1279) ICE (pes 1280 to 3583) OCN (pes 3840 to 4607) ROF (pes 0 to 1023) GLC (pes 3712 to 3775) WAV (pes 3776 to 3839) ESP (pes 0 to 0) More info on coupler timing: