strombrg@dcs_nac_uci_edu
New Member
Hi folks.
I've been tasked with getting CCSM 3.0.1 beta 14 (probably just the most recent version, really) building on lightning in preparation for the arrival of a similar compute cluster here at UCI.
It's building, but I'm not sure it's running as it should. If I'm not mistaken, it appears to be thinking that it should run on only one node - is that what it should be doing? The final output is:
Shared memory file: /tmp/gmpi_shmem-9504250:[0-9]*.tmp
/usr/bin/ssh -l strombrg ln0310en "cd /ptmp/strombrg/T31x3/all && exec env GMPI_MASTER=ln0310en GMPI_PORT=47854 GMPI_SHMEM=1 GMPI_SHMEM_PREFIX=/tmp/gmpi_shmem- GMPI_VERBOSE=1 LD_LIBRARY_PATH=/contrib/2.6/pathscale/2.2.1/lib/2.2.1:/contrib/2.6/mpich-gm/1.2.6..14a-pathscale-2.2.1-64/lib:/opt/gm/lib64 DISPLAY=ln0127en:16.0 GMPI_MAGIC=9504250 GMPI_ID=0 GMPI_NP=1 GMPI_BOARD=-1 GMPI_SLAVE=192.168.150.80 /usr/local/lsf/6.2/linux2.6-glibc2.3-x86_64/bin/TaskStarter -p ln0310en:47853 -c /usr/local/lsf/conf -a X86_64 /ptmp/strombrg/T31x3/all/cpl "
All processes have been spawned
Warning: Permanently added the RSA host key for IP address '192.168.150.80' to the list of known hosts.^M
(main) =========================================================================
(main) CCSM Coupler, version 6 (cpl6)
(main) CVS tag $Name: ccsm3_0_1_beta14 $
(main) date & time: 2006-05-18 16:11:13
(main) =========================================================================
(cpl_comm_init) setting up communicators, name = cpl
===================================
warning: global processor 0 is overlapped
(cpl_comm_init) cpl_comm_comp, size: 137 1
User defined signal 2
Job /usr/local/lsf/6.2/linux2.6-glibc2.3-x86_64/bin/gmmpirun_wrapper -v -pg mpirun.pgfile ./cpl
TID HOST_NAME COMMAND_LINE STATUS TERMINATION_TIME
===== ========== ================ ======================= ===================
00000 ln0310en /ptmp/strombrg/T Exit (status unknown)
Thanks!
I've been tasked with getting CCSM 3.0.1 beta 14 (probably just the most recent version, really) building on lightning in preparation for the arrival of a similar compute cluster here at UCI.
It's building, but I'm not sure it's running as it should. If I'm not mistaken, it appears to be thinking that it should run on only one node - is that what it should be doing? The final output is:
Shared memory file: /tmp/gmpi_shmem-9504250:[0-9]*.tmp
/usr/bin/ssh -l strombrg ln0310en "cd /ptmp/strombrg/T31x3/all && exec env GMPI_MASTER=ln0310en GMPI_PORT=47854 GMPI_SHMEM=1 GMPI_SHMEM_PREFIX=/tmp/gmpi_shmem- GMPI_VERBOSE=1 LD_LIBRARY_PATH=/contrib/2.6/pathscale/2.2.1/lib/2.2.1:/contrib/2.6/mpich-gm/1.2.6..14a-pathscale-2.2.1-64/lib:/opt/gm/lib64 DISPLAY=ln0127en:16.0 GMPI_MAGIC=9504250 GMPI_ID=0 GMPI_NP=1 GMPI_BOARD=-1 GMPI_SLAVE=192.168.150.80 /usr/local/lsf/6.2/linux2.6-glibc2.3-x86_64/bin/TaskStarter -p ln0310en:47853 -c /usr/local/lsf/conf -a X86_64 /ptmp/strombrg/T31x3/all/cpl "
All processes have been spawned
Warning: Permanently added the RSA host key for IP address '192.168.150.80' to the list of known hosts.^M
(main) =========================================================================
(main) CCSM Coupler, version 6 (cpl6)
(main) CVS tag $Name: ccsm3_0_1_beta14 $
(main) date & time: 2006-05-18 16:11:13
(main) =========================================================================
(cpl_comm_init) setting up communicators, name = cpl
===================================
warning: global processor 0 is overlapped
(cpl_comm_init) cpl_comm_comp, size: 137 1
User defined signal 2
Job /usr/local/lsf/6.2/linux2.6-glibc2.3-x86_64/bin/gmmpirun_wrapper -v -pg mpirun.pgfile ./cpl
TID HOST_NAME COMMAND_LINE STATUS TERMINATION_TIME
===== ========== ================ ======================= ===================
00000 ln0310en /ptmp/strombrg/T Exit (status unknown)
Thanks!