ep2764@columbia_edu
Member
I am running into a runtime error coming from pio. In the cesm log it shows up as such:
...
...
Opened existing file
/nobackup/mjmills2/ccsmdata/inputdata/atm/waccm/ic/waccm_geos5_2x_88L_2005-01-0
1_c110419.nc -1
Opened existing file
/nobackup/mjmills2/ccsmdata/inputdata/atm/cam/met/USGS-gtopo30_1.9x2.5_phys_geo
s5_c100929.nc -1
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
MPT: Global rank 5 is aborting with error code 1.
Process ID: 5985, Host: r329i5n16, Program: /nobackupp8/epeck/sdwaccm_codrescu_1_2_2/bld/cesm.exe
MPT: --------stack traceback-------
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
MPT: Global rank 93 is aborting with error code 1.
Process ID: 94596, Host: r329i6n3, Program: /nobackupp8/epeck/sdwaccm_codrescu_1_2_2/bld/cesm.exe
...
... this repeats for a while followed by other messages
...
...
MPT: #13 0x00007fffffff84fc in ?? ()
MPT: #14 0x00007fffffff8500 in ?? ()
MPT: #15 0x00007fffffff8504 in ?? ()
MPT: #16 0x00007fffffff80a8 in ?? ()
MPT: #17 0x00007fffffff80ac in ?? ()
MPT: #18 0x00000000055b1340 in ?? ()
MPT: #19 0x0000000000000050 in ?? ()
MPT: #20 0x0000000000000050 in ?? ()
MPT: #21 0x0000000000000020 in ?? ()
MPT: #22 0x000000000b7ae440 in ?? ()
MPT: #23 0x0000000000000020 in ?? ()
MPT: #24 0x000000000b7ae440 in ?? ()
MPT: #25 0xffffffee136a6201 in ?? ()
MPT: #26 0x0000000000000004 in ?? ()
MPT: #27 0xffffffee137e0001 in ?? ()
MPT: #28 0x0000001900000018 in ?? ()
MPT: #29 0x08000cd80b7b5b30 in ?? ()
MPT: #30 0x00002aaa00040007 in ?? ()
MPT: #31 0x0849ab9c0131f0b5 in ?? ()
MPT: #32 0x00002aaaac64a598 in mlx4_free_srq_wqe (srq=0x1b68d5c, ind=-46260)
MPT: at src/srq.c:55
MPT: #33 0x00002aaaac648303 in mlx4_poll_cq (ibcq=0x246, ne=1, wc=0x600000000)
MPT: at src/cq.c:370
MPT: #34 0x00002aaaaad5aa69 in ibv_poll_cq (wc=,
MPT: num_entries=, cq=)
MPT: at ../../../../include/verbs_1.2.h:940
MPT: #35 MPI_SGI_ib_progress (wc=,
MPT: num_entries=, cq=)
MPT: at ibdev_multirail.c:4475
MPT: #36 0x00002aaaaad7ca5d in MPI_SGI_progress () at progress.c:215
MPT: #37 0x00002aaaaad87221 in MPI_SGI_request_wait (request=0x7fffffff5ce4,
MPT: status=0x5592a60, set=0x7fffffff5ce0, gen_rc=0x7fffffff5cdc) at req.c:1608
MPT: #38 0x00002aaaaad8f8fd in MPI_SGI_recv (buf=,
MPT: count=, type=,
MPT: des=, tag=,
MPT: comm=, status=0x7fffffff88b0) at sugar.c:40
MPT: #39 0x0000000000000000 in ?? ()
MPT: (gdb) A debugging session is active.
MPT:
MPT: Inferior 1 [process 64775] will be detached.
MPT:
MPT: Quit anyway? (y or n) [answered Y; input not from terminal]
MPT: Detaching from program: /proc/64775/exe, process 64775
MPT: -----stack traceback ends-----
MPT: -----stack traceback ends-----
MPT: -----stack traceback ends-----
MPT: MPI_COMM_WORLD rank 49 has terminated without calling MPI_Finalize()
aborting job
The only thing that says ERROR is related to pio, nf_mod.F90. Any ideas on what is wrong? Thanks!
-Ethan
...
...
Opened existing file
/nobackup/mjmills2/ccsmdata/inputdata/atm/waccm/ic/waccm_geos5_2x_88L_2005-01-0
1_c110419.nc -1
Opened existing file
/nobackup/mjmills2/ccsmdata/inputdata/atm/cam/met/USGS-gtopo30_1.9x2.5_phys_geo
s5_c100929.nc -1
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
MPT: Global rank 5 is aborting with error code 1.
Process ID: 5985, Host: r329i5n16, Program: /nobackupp8/epeck/sdwaccm_codrescu_1_2_2/bld/cesm.exe
MPT: --------stack traceback-------
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
pio_support::pio_die:: myrank= -1 : ERROR: nf_mod.F90: 1229 :
NETCDF not enabled in the build
MPT: Global rank 93 is aborting with error code 1.
Process ID: 94596, Host: r329i6n3, Program: /nobackupp8/epeck/sdwaccm_codrescu_1_2_2/bld/cesm.exe
...
... this repeats for a while followed by other messages
...
...
MPT: #13 0x00007fffffff84fc in ?? ()
MPT: #14 0x00007fffffff8500 in ?? ()
MPT: #15 0x00007fffffff8504 in ?? ()
MPT: #16 0x00007fffffff80a8 in ?? ()
MPT: #17 0x00007fffffff80ac in ?? ()
MPT: #18 0x00000000055b1340 in ?? ()
MPT: #19 0x0000000000000050 in ?? ()
MPT: #20 0x0000000000000050 in ?? ()
MPT: #21 0x0000000000000020 in ?? ()
MPT: #22 0x000000000b7ae440 in ?? ()
MPT: #23 0x0000000000000020 in ?? ()
MPT: #24 0x000000000b7ae440 in ?? ()
MPT: #25 0xffffffee136a6201 in ?? ()
MPT: #26 0x0000000000000004 in ?? ()
MPT: #27 0xffffffee137e0001 in ?? ()
MPT: #28 0x0000001900000018 in ?? ()
MPT: #29 0x08000cd80b7b5b30 in ?? ()
MPT: #30 0x00002aaa00040007 in ?? ()
MPT: #31 0x0849ab9c0131f0b5 in ?? ()
MPT: #32 0x00002aaaac64a598 in mlx4_free_srq_wqe (srq=0x1b68d5c, ind=-46260)
MPT: at src/srq.c:55
MPT: #33 0x00002aaaac648303 in mlx4_poll_cq (ibcq=0x246, ne=1, wc=0x600000000)
MPT: at src/cq.c:370
MPT: #34 0x00002aaaaad5aa69 in ibv_poll_cq (wc=,
MPT: num_entries=, cq=)
MPT: at ../../../../include/verbs_1.2.h:940
MPT: #35 MPI_SGI_ib_progress (wc=,
MPT: num_entries=, cq=)
MPT: at ibdev_multirail.c:4475
MPT: #36 0x00002aaaaad7ca5d in MPI_SGI_progress () at progress.c:215
MPT: #37 0x00002aaaaad87221 in MPI_SGI_request_wait (request=0x7fffffff5ce4,
MPT: status=0x5592a60, set=0x7fffffff5ce0, gen_rc=0x7fffffff5cdc) at req.c:1608
MPT: #38 0x00002aaaaad8f8fd in MPI_SGI_recv (buf=,
MPT: count=, type=,
MPT: des=, tag=,
MPT: comm=, status=0x7fffffff88b0) at sugar.c:40
MPT: #39 0x0000000000000000 in ?? ()
MPT: (gdb) A debugging session is active.
MPT:
MPT: Inferior 1 [process 64775] will be detached.
MPT:
MPT: Quit anyway? (y or n) [answered Y; input not from terminal]
MPT: Detaching from program: /proc/64775/exe, process 64775
MPT: -----stack traceback ends-----
MPT: -----stack traceback ends-----
MPT: -----stack traceback ends-----
MPT: MPI_COMM_WORLD rank 49 has terminated without calling MPI_Finalize()
aborting job
The only thing that says ERROR is related to pio, nf_mod.F90. Any ideas on what is wrong? Thanks!
-Ethan