ONLY ERROR, I am suffering while running CAM6_03_80 one-year simulation, it is stopped after 11 months in previous and now after 8 months after the following ERROR
Do you have some expert to respond me
About this
What is MPI_COMM_WORLD Rank
Why my case run is stopped after 8 months of successful run (one year simulation), not running in September, October, November, December 2019.
The path is as follows
/glade/scratch/dksingh/anual_2019_contrail_runtest
The detailed about error is as follows
55:MPT ERROR: Rank 55(g:55) received signal SIGFPE(8).
280:MPT ERROR: Rank 280(g:280) received signal SIGFPE(8).
25:MPT ERROR: Rank 25(g:25) received signal SIGFPE(8).
55:MPT: header=header@entry=0x7fff5be9cc50 "MPT ERROR: Rank 55(g:55) received signal SIGFPE(8).\n\tProcess ID: 11926, Host: r11i2n5, Program: /glade/scratch/dksingh/anual_2019_contrail_runtest/bld/cesm.exe\n\tMPT Version: HPE MPT 2.25 08/14/21 03:"...) at sig.c:340
280:MPT: header=header@entry=0x7ffd134089d0 "MPT ERROR: Rank 280(g:280) received signal SIGFPE(8).\n\tProcess ID: 20831, Host: r11i6n2, Program: /glade/scratch/dksingh/anual_2019_contrail_runtest/bld/cesm.exe\n\tMPT Version: HPE MPT 2.25 08/14/21 0"...) at sig.c:340
55:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
55:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
55:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
55:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
280:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
280:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
280:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: importstate=<error reading variable: Location address is not set.>,
55:MPT: exportstate=<error reading variable: Location address is not set.>,
55:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=<error reading variable: Location address is not set.>,
55:MPT: exportstate=<error reading variable: Location address is not set.>,
55:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: importstate=<error reading variable: Location address is not set.>,
280:MPT: exportstate=<error reading variable: Location address is not set.>,
280:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=<error reading variable: Location address is not set.>,
280:MPT: exportstate=<error reading variable: Location address is not set.>,
280:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
-1:MPT ERROR: MPI_COMM_WORLD rank 55 has terminated without calling MPI_Finalize()
Do you have some expert to respond me
About this
What is MPI_COMM_WORLD Rank
Why my case run is stopped after 8 months of successful run (one year simulation), not running in September, October, November, December 2019.
The path is as follows
/glade/scratch/dksingh/anual_2019_contrail_runtest
The detailed about error is as follows
55:MPT ERROR: Rank 55(g:55) received signal SIGFPE(8).
280:MPT ERROR: Rank 280(g:280) received signal SIGFPE(8).
25:MPT ERROR: Rank 25(g:25) received signal SIGFPE(8).
55:MPT: header=header@entry=0x7fff5be9cc50 "MPT ERROR: Rank 55(g:55) received signal SIGFPE(8).\n\tProcess ID: 11926, Host: r11i2n5, Program: /glade/scratch/dksingh/anual_2019_contrail_runtest/bld/cesm.exe\n\tMPT Version: HPE MPT 2.25 08/14/21 03:"...) at sig.c:340
280:MPT: header=header@entry=0x7ffd134089d0 "MPT ERROR: Rank 280(g:280) received signal SIGFPE(8).\n\tProcess ID: 20831, Host: r11i6n2, Program: /glade/scratch/dksingh/anual_2019_contrail_runtest/bld/cesm.exe\n\tMPT Version: HPE MPT 2.25 08/14/21 0"...) at sig.c:340
55:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
55:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
55:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
55:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
280:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
280:MPT: cam_in=<error reading variable: value requires 97248 bytes, which is more than max-value-size>,
280:MPT: cam_out=<error reading variable: value requires 107808 bytes, which is more than max-value-size>)
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: importstate=<error reading variable: Location address is not set.>,
55:MPT: exportstate=<error reading variable: Location address is not set.>,
55:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=<error reading variable: Location address is not set.>,
55:MPT: exportstate=<error reading variable: Location address is not set.>,
55:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
55:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=..., exportstate=..., clock=...,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: index=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: existflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: importstate=<error reading variable: Location address is not set.>,
280:MPT: exportstate=<error reading variable: Location address is not set.>,
280:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: port=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: keywordenforcer=<error reading variable: Cannot access memory at address 0x0>, importstate=<error reading variable: Location address is not set.>,
280:MPT: exportstate=<error reading variable: Location address is not set.>,
280:MPT: clock=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: syncflag=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: phase=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeout=<error reading variable: Cannot access memory at address 0x0>,
280:MPT: timeoutflag=<error reading variable: Cannot access memory at address 0x0>,
-1:MPT ERROR: MPI_COMM_WORLD rank 55 has terminated without calling MPI_Finalize()