Hi MOM6 experts:
Two questions:
(1) When the code exits with a Fatal error, is there normally an MPI_Barrier before the MPI_Finalize?
I am a little concerned if I am seeing all the output in my mom6.o* log file. Is it possible that the lead proc detects the Fatal error signal and shuts down before all the output has been flushed into the mom6.o* log file?
(2) The Fatal error is due to a disagreement of checksums in an input file. What could cause a corrupt checksum?
86593:FATAL from PE 0: SIS_restart(restore_state): Checksum of input field part_size AFF344408441C19A does not match value 231344408441C19A stored in RESTART/ice_model.res.nc
There are two oddities about this: (a) the checksums only differ in the 3st three hex digits, and (b) the md5sum of the entire file matches the value of the same file on another computer where the code is known to work.
-Ed
Two questions:
(1) When the code exits with a Fatal error, is there normally an MPI_Barrier before the MPI_Finalize?
I am a little concerned if I am seeing all the output in my mom6.o* log file. Is it possible that the lead proc detects the Fatal error signal and shuts down before all the output has been flushed into the mom6.o* log file?
(2) The Fatal error is due to a disagreement of checksums in an input file. What could cause a corrupt checksum?
86593:FATAL from PE 0: SIS_restart(restore_state): Checksum of input field part_size AFF344408441C19A does not match value 231344408441C19A stored in RESTART/ice_model.res.nc
There are two oddities about this: (a) the checksums only differ in the 3st three hex digits, and (b) the md5sum of the entire file matches the value of the same file on another computer where the code is known to work.
-Ed