Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

runtime errors in clm4_5_00

Hi!I am working on code modification within clm4_5_00 tag. I have got the following runtime errors in my cesm.log file. The similar code is working perfectly in the erlier versions of clm (e.g., clm4_0_60).
I double checked my input files and they look good. Perhaps somebody else has encountered these types or error before while working on the codes? I am curious where could be their source. 
I would be very grateful for any advice you may think of. ERROR: 0031-300 Forcing all remote tasks to exit due to exit code 1 in task 1122:forrtl: error (78): process killed (SIGTERM)
122:Image PC Routine Line Source
122:libpthread.so.0 00002AE285C58245 Unknown Unknown Unknown
122:libpoe.so 00002AE28E4FBBA3 Unknown Unknown Unknown
122:INFO: 0031-306 pm_atexit: pm_exit_value is 1.
124:forrtl: error (78): process killed (SIGTERM)
124:Image PC Routine Line Source
124:libpthread.so.0 00002B870E94F245 Unknown Unknown Unknown
124:libpoe.so 00002B87171F2BA3 Unknown Unknown Unknown
124:INFO: 0031-306 pm_atexit: pm_exit_value is 1.
131:forrtl: error (78): process killed (SIGTERM)
131:Image PC Routine Line Source
131:libpthread.so.0 00002B25A68AA245 Unknown Unknown Unknown
131:libpoe.so 00002B25AF14DBA3 Unknown Unknown Unknown
131:INFO: 0031-306 pm_atexit: pm_exit_value is 1.
126:forrtl: error (78): process killed (SIGTERM)
126:Image PC Routine Line Source
126:libpthread.so.0 00002B9B1B246245 Unknown Unknown Unknown
126:libpoe.so 00002B9B23AE9BA3 Unknown Unknown Unknown
126:INFO: 0031-306 pm_atexit: pm_exit_value is 1.
121:forrtl: error (78): process killed (SIGTERM)
121:Image PC Routine Line Source
121:libpthread.so.0 00002B871AB09245 Unknown Unknown Unknown
121:libpoe.so 00002B87233ACBA3 Unknown Unknown Unknown
121:INFO: 0031-306 pm_atexit: pm_exit_value is 1.
125:forrtl: error (78): process killed (SIGTERM)
125:Image PC Routine Line Source
125:libpthread.so.0 00002B8C44BFE245 Unknown Unknown Unknown
125:libpoe.so 00002B8C4D4A1BA3 Unknown Unknown Unknown
125:INFO: 0031-306 pm_atexit: pm_exit_value is 1.Thanks,Rajendra
 

slevis

Moderator
Staff member
Did you check whether there were error messages in the other log files (land, atm, etc) and the cesm.stderr and stdout files? If none of these contain another error, then we'll need to see if a software engineer can help with this error that you're getting...Sam
 

jedwards

CSEG and Liaisons
Staff member
These errors are something that the system outputs any time you exit abnormally.   Please look further up in this file and/or at the other log files to find the correct error message.
If you post your case and run directories we might be able to help.
 
Hi Jim and Sam,Thanks for the swift resposne. Below are the case and run directories:Case: /glade/u/home/rpaudel/constant_work/clm4500_const/scripts/f19_g16_I1850CLM45CN4Me_clm4500_bfb_const_01Run: /glade/scratch/rpaudel/f19_g16_I1850CLM45CN4Me_clm4500_bfb_const_01The same code modification I did for the clm4_0_60 and it is working. Rajendra
 

jedwards

CSEG and Liaisons
Staff member
The following in your cesm.log file indicates that you are trying to write an invalid value (NaN or INF) to the clm history file:   1: ./f19_g16_I1850CLM45CN4Me_clm4500_bfb_const_01.clm2.h0.0001-01-01-00000.nc
   1:  to write      131072
   1: pio_support::pio_die:: myrank=          -1 : ERROR:
   1: pionfwrite_mod::write_nfdarray_double:         234 :
   1: NetCDF: Numeric conversion not representableI can't tell for sure which variable but perhaps it's  ACTUAL_IMMOB 
 

mai

Member
Perhaps your new code is producing NaNs? Did you notice these errors in your cesm.log.130507-133146 file:   1: Opened file
   1: ./f19_g16_I1850CLM45CN4Me_clm4500_bfb_const_01.clm2.h0.0001-01-01-00000.nc
   1:  to write      131072
   1: pio_support::pio_die:: myrank=          -1 : ERROR:
   1: pionfwrite_mod::write_nfdarray_double:         234 :
   1: NetCDF: Numeric conversion not representable

 

slevis

Moderator
Staff member
Thank you, Jim and Andy. Dear Rajendra, please be aware that our cesm support generally does not extend beyond the default model. This means that users who modify code or datasets may ask questions in this forum but should generally work on problem-solving on their own rather than expecting others to do this. In your case, for example, we ask that you look for the errors caused by your code changes yourself. And ofcourse you are responsible for your own debugging.Again, if you feel stumped at times, this is the appropriate forum to ask for help, but I needed to clarify the extent of cesm support that we offer.Thank you for your understanding,Sam
 
Could you tell me how to solve this problem similar with yours :  pio_support::pio_die:: myrank=          -1 : ERROR:  pionfwrite_mod::write_nfdarray_double:         234 :  NetCDF: Numeric conversion not representableThank you !  
 
Top