Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

Using a restart file

Greetings, I successfully built and ran a CESM version 1.2.0 B_2000_WACCM_CN model for one year, archived monthly. I also would like to make two other runs for 120 hours each, archived hourly. One of these will be run using a restart file, while the other one won't be. Will I need to change the CONTINUE_RUN and RESUBMIT variables for the env_run.xml file in order to use the restart file correctly? What would I need to change to create the run without the restart file? Thank you. 
 

santos

Member
CONTINUE_RUN is used to signify that a case has already been started, and you want to continue it. If you have a brand new case and you want it to start from the restart from a different case, you should set the RUN_TYPE to "branch", and set RUN_REFCASE and other "RUN_REF" variables appropriately. You may also have to move or link the restart files to your run directory by hand, if they are from your own run and not our inputdata.The default values of STOP_OPTION and STOP_N for a new case will give you a 5 day run, which is 120 hours.Assuming that you are primarily interested in CAM's output, the namelist options "nhtfreq" and "mfilt" are the main controls for the output frequency:http://www.cesm.ucar.edu/models/cesm1.2/cesm/doc/modelnl/nl_cam.html
 
When I have a job running on the background of our supported machine, it runs and produces the output properly until I logoff and check the output the following day. Even when I use the 'top' function to check for cesm.exe, it will not show up, making me feel the job is not running properly. What should I do in order to check for this? 
 

santos

Member
Is there an error in any log, or does the model just stop? What script/command are you using to run the model?
 

santos

Member
This is not related to CESM. It's a normal attribute of UNIX systems to send a signal to close processes started by a terminal in this case:http://en.wikipedia.org/wiki/SIGHUP#SIGHUPThe CESM scripts are designed to work with a machine's batch system. If you have a supported machine, you should run the "submit" script to submit a job for you, rather than running the "run" script directly.If you use the run script from a terminal, you'll have to either leave the terminal window open or find a way to "daemonize" CESM. Bash has a "disown" command for this.
 
Whenever I call the $CASE.submit script I receive an error message which states USERDEFINED_optional_run: Command not found; ERROR: cesm_submit problem sourcing templar. I'm not sure what this means.
 

santos

Member
This means that the port to your machine is incomplete. Assuming that you are running this on a cluster with a batch control system, the BATCHSUBMIT variable needs to be set to use the batch system on that machine. Otherwise, you will have to find your own method to run the model in a way appropriate for that machine.
 
When I'm prestaging the restart data into my new case's run directory, the hsi command on p. 58 of the User's Manual is unrecognizable when I type it in my system. Can I just manually move my files to the new run directory? 
 

santos

Member
Yes, you can just copy it by hand. The hsi command is just an example of what you would do to get data from NCAR 's HPSS.
 

hannay

Cecile Hannay
AMWG Liaison
Staff member
You can use $CASE.run to submit your job to the batch queue.The command to use will depend on the computer:for a Cray computer, you would use:qsub $CASE.run
 
I successfully ran a case for one year archived monthly, and had the RESUBMIT value set to 9 (so the model could continue this startup run for another 10 years). How come I received this message below once the run finished? Tue Nov  5 11:11:04 CST 2013 -- CSM EXECUTION BEGINS HEREFri Nov 15 21:48:48 CST 2013 -- CSM EXECUTION HAS FINISHED(seq_mct_drv): ===============       SUCCESSFUL TERMINATION OF CPL7-CCSM ===============cp: No match.RESUBMIT is now 9USERDEFINED_optional_run: Command not found. ccsm_postrun error: problem sourcing tempres
 
I also created a new case for a hybrid run, using a startup case from a previous run as a reference. When I tried to run that case, I get this error message: ------------------------------------------------------------------------- CESM PRESTAGE SCRIPT STARTING - Case input data directory, DIN_LOC_ROOT, is /tng4/users/ldiao/CESM/data - Checking the existence of input datasets in DIN_LOC_ROOT The following files were not found, this is informational onlyInput Data List Files Found:/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/cam.input_data_list/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/cpl.input_data_list/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/pop2.input_data_list/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/cice.input_data_list/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/rtm.input_data_list/tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_18_hybrid/Buildconf/clm.input_data_listFile status unknown: B_2000_WACCM_CN_11_05_test_on_slave_node_n11.cam.i.0003-01-01-00000.nc File status unknown: B_2000_WACCM_CN_11_05_test_on_slave_node_n11.rtm.r.0003-01-01-00000.nc File status unknown: B_2000_WACCM_CN_11_05_test_on_slave_node_n11.clm2.r.0003-01-01-00000.nc   CESM PRESTAGE SCRIPT HAS FINISHED SUCCESSFULLY-------------------------------------------------------------------------Mon Nov 18 12:03:02 CST 2013 -- CSM EXECUTION BEGINS HEREMon Nov 18 12:06:34 CST 2013 -- CSM EXECUTION HAS FINISHEDModel did not complete - see /tng1/users/jrodrigu/CESM/cesm1_2_0/exe_11_18/run/cesm.log.131118-120233How come when I prestage those restart files that the model claims to be an "unkown file status" I get this error message? Attached is the cesm.log.131118-120233.
 

jedwards

CSEG and Liaisons
Staff member

Code:
PGFIO-F-209/OPEN/unit=93/'OLD' specified for file which does not exist.
 File name = rpointer.ocn.ovf
 
When I set up a new case for a branch run, I received this error message: ------------------------------------------------------------------------- CESM BUILDNML SCRIPT STARTING - To prestage restarts, untar a restart.tar file into /tng1/users/jrodrigu/CESM/cesm1_2_0/exe_11_25_a/run infile is /tng1/users/jrodrigu/CESM/cesm1_2_0/scripts/11_25_a/Buildconf/cplconf/cesm_namelist CAM writing dry deposition namelist to drv_flds_in CAM writing namelist to atm_in CLM configure done.CLM adding use_case 2000_control defaults for var sim_year with val 2000 CLM adding use_case 2000_control defaults for var sim_year_range with val constant CLM adding use_case 2000_control defaults for var stream_year_first_ndep with val 2000 CLM adding use_case 2000_control defaults for var stream_year_last_ndep with val 2000 CLM adding use_case 2000_control defaults for var use_case_desc with val Conditions to simulate 2000 land-use CICE configure done.POP2 build-namelist: ocn_grid is gx1v6 POP2 build-namelist: ocn_tracer_modules are  iage ERROR(Build::Namelist::_parse_next): expect a variable instead got:  11_25.rtm.r.0001-01-03-00000.nc ERROR: rtm.buildnml.csh failedERROR from preview namelist - EXITING  I don't understand how the river runoff component for this case inhibits the model's ability to run. Would I need to edit a specific file in the rtm? Thank you. 
 
If I have a year's worth of data archived monthly and I want to create a plot using NCL to make a year's worth of that data, can I use a function to merge the monthly archived data into one file, so I can upload it to NCL and mark the months onto one axis? 
 

hannay

Cecile Hannay
AMWG Liaison
Staff member
You can concatenate the files using the nco operator.
ncrcat allows to concatenate files along the record dimension, and store the results in a new file that you can open with ncl.

For instance,
 
Last edited:

sjbourguet

Stephen Bourguet
New Member
Hi Cecile,

I am currently having what I think is the same issue as what is being described here. I've tried doing a CESM1.2 branch run, but it fails while initializing RTM. My cesm.log file ends with:

(shr_sys_abort) ERROR: GETFIL: FAILED to get .nc
(shr_sys_abort) WARNING: calling shr_mpi_abort() and stopping

and my rof.log file contains:

Read in rtm_inparm namelist from: rof_in
define run:
run type = branch
RTM :: flooding is off
RTM :: use default effective velocity (4.0)
RTM :: water isotope runoff is on
RTM river data =
/n/holylfs/INTERNAL_REPOS/CLIMATE_MODELS/cesm_2_1/inputdata/lnd/clm2/rtmdata/rdirc_0.5x0.5_simyr2000_slpmxvl_c120717.nc
(GETFIL): attempting to find local file .nc
(GETFIL): failed getting file from full path:
.nc

I assume this is an issue with my rof_in file, which does not contain the nrevsn_rtm file that should be branched from (which is the same file contained in my rpointer.rof file). I've played around with the rtm.buildnml.csh file to try to create the correct rof_in file, but nothing seems to work. I know this topic is from over 6 years ago, but any help you could offer would be greatly appreciated.

Thanks!
 
Top