wlee@purdue_edu
New Member
Dear CCSM developers,
I am running a test case "TER.01a.T31_gx3v5.B" for 10 days.
When I set walltime to 30 minutes and 90 minutes, I had the same errors and the same output messages.
Error message looks like this
----------------------------------------------------
print_memusage: size, rss, share, text, datastack= 43294 16178 1615 643 0
print_memusage iam 1 End aerosol_initialize. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 28028 15546 453 643 0
PGFIO-F-253/unformatted read/unit=4/attempt to read non-existent record (direct access).
File name = /grp/tgportal/CCSM/ccsm-work/TER.01a.T31_gx3v5.B.steele.231142/ocn/input/chl_dat unformatted, direct access record = 1
In source file /grp/tgportal/CCSM/ccsm-work/TER.01a.T31_gx3v5.B.steele.231142/ocn/obj/source/io.F, at line number 854
print_memusage iam 0 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 55050 34677 2577 643 0
print_memusage iam 1 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 39667 34186 663 643 0
=>> PBS: job killed: walltime 651 exceeded limit 1800
Terminated
-----------------------------------------------------------------
Tail of Output is as follows
-----------------------------------------------------------------
(tStamp_write) cpl model date 0001-01-01 72000s wall clock 2008-08-08 01:34:41 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 75600s wall clock 2008-08-08 01:34:43 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 79200s wall clock 2008-08-08 01:34:46 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 82800s wall clock 2008-08-08 01:34:48 avg dt 3s dt 2s
Start of time coordinated integration
(tStamp_write) cpl model date 0001-01-02 00000s wall clock 2008-08-08 01:34:57 avg dt 3s dt 9s
(cpl_control_update) cpl_control_stopEOD = .true.
Terminated
-----------------------------------------------------------------
For another test case, I had the same output/errors for 30 minutes and 24 hours. What's happening here???
Am I doing okay?
I am running a test case "TER.01a.T31_gx3v5.B" for 10 days.
When I set walltime to 30 minutes and 90 minutes, I had the same errors and the same output messages.
Error message looks like this
----------------------------------------------------
print_memusage: size, rss, share, text, datastack= 43294 16178 1615 643 0
print_memusage iam 1 End aerosol_initialize. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 28028 15546 453 643 0
PGFIO-F-253/unformatted read/unit=4/attempt to read non-existent record (direct access).
File name = /grp/tgportal/CCSM/ccsm-work/TER.01a.T31_gx3v5.B.steele.231142/ocn/input/chl_dat unformatted, direct access record = 1
In source file /grp/tgportal/CCSM/ccsm-work/TER.01a.T31_gx3v5.B.steele.231142/ocn/obj/source/io.F, at line number 854
print_memusage iam 0 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 55050 34677 2577 643 0
print_memusage iam 1 stepon after dynpkg. -1 in the next line means unavailable
print_memusage: size, rss, share, text, datastack= 39667 34186 663 643 0
=>> PBS: job killed: walltime 651 exceeded limit 1800
Terminated
-----------------------------------------------------------------
Tail of Output is as follows
-----------------------------------------------------------------
(tStamp_write) cpl model date 0001-01-01 72000s wall clock 2008-08-08 01:34:41 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 75600s wall clock 2008-08-08 01:34:43 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 79200s wall clock 2008-08-08 01:34:46 avg dt 3s dt 2s
(tStamp_write) cpl model date 0001-01-01 82800s wall clock 2008-08-08 01:34:48 avg dt 3s dt 2s
Start of time coordinated integration
(tStamp_write) cpl model date 0001-01-02 00000s wall clock 2008-08-08 01:34:57 avg dt 3s dt 9s
(cpl_control_update) cpl_control_stopEOD = .true.
Terminated
-----------------------------------------------------------------
For another test case, I had the same output/errors for 30 minutes and 24 hours. What's happening here???
Am I doing okay?