egil_storen@met_no
New Member
Hello,
I am trying to port CCSM3 to a Linux cluster based on AMD Opteron processors using PGI compilers. I have managed to build the software, and it runs for some time (about 5 minutes) on 28 nodes before it stops by segmentation violation. The output to stdout/stderr just before it stops is:
(cpl_bundle_copy) WARNING: bundle aoflux_o has accum count = 0
(flux_atmOcn) FYI: this routine is not threaded
print_memusage iam 4 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 21282 21280 859 5957 0
... (the above two lines are repeated 12 times, one time from each of the cam allocated nodes)
--- mpimon --- Aborting run after process-3 terminated abnormally Childprocess 873 got signal SIGSEGV(11): segmentation violation ---
I am not sure how to track down this error, and I hope you could give me some advice.
Best regards,
Egil Støren
The Norwegian Meteorological Institute
Norway
I am trying to port CCSM3 to a Linux cluster based on AMD Opteron processors using PGI compilers. I have managed to build the software, and it runs for some time (about 5 minutes) on 28 nodes before it stops by segmentation violation. The output to stdout/stderr just before it stops is:
(cpl_bundle_copy) WARNING: bundle aoflux_o has accum count = 0
(flux_atmOcn) FYI: this routine is not threaded
print_memusage iam 4 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 21282 21280 859 5957 0
... (the above two lines are repeated 12 times, one time from each of the cam allocated nodes)
--- mpimon --- Aborting run after process-3 terminated abnormally Childprocess 873 got signal SIGSEGV(11): segmentation violation ---
I am not sure how to track down this error, and I hope you could give me some advice.
Best regards,
Egil Støren
The Norwegian Meteorological Institute
Norway