andrew_kren@colorado_edu
Member
I started getting an error when I submitted my waccm simulation to the NASA pleiades-san system. Once it began to run, it came up with this error in the cesm log file and stopped:
asremexec (host 'r307i3n3'): request failed - unable to start interactive
connection
/nasa/sgi/mpt/2.08r7/bin/mpiexec_mpt.real: line 335: 21243
Killed $mpicmdline_prefix -f $paramfile
I contacted NAS support at NASA and they said this:
There is an unresolved issue on pleiades that causes MPI jobs to fail at startup. The only know workaround is to retry. The following knowledge base article has more details.
http://www.nas.nasa.gov/hecc/support/kb/MPT-Startup-Failures_469.html
I tried the link suggestions but to no avail. They said there is currently not a timetable for this resolution to be fixed. Does anyone know of a workaround to get it to run still?
Thanks,
asremexec (host 'r307i3n3'): request failed - unable to start interactive
connection
/nasa/sgi/mpt/2.08r7/bin/mpiexec_mpt.real: line 335: 21243
Killed $mpicmdline_prefix -f $paramfile
I contacted NAS support at NASA and they said this:
There is an unresolved issue on pleiades that causes MPI jobs to fail at startup. The only know workaround is to retry. The following knowledge base article has more details.
http://www.nas.nasa.gov/hecc/support/kb/MPT-Startup-Failures_469.html
I tried the link suggestions but to no avail. They said there is currently not a timetable for this resolution to be fixed. Does anyone know of a workaround to get it to run still?
Thanks,