Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

NetCDF: Numeric conversion not representable

Hello, after I switch to mpt/2.19 and update the pio/, I can run the model (cesm1.1.1_ccmi34). However, the model crashes right away with the following errors.
49: NetCDF: Numeric conversion not representable29: NetCDF: Numeric conversion not representable33: NetCDF: Numeric conversion not representable41: NetCDF: Numeric conversion not representable45: NetCDF: Numeric conversion not representable17: pio_support::pio_die:: myrank=          -1 : ERROR:17: pionfwrite_mod::write_nfdarray_double:         250 :17: NetCDF: Numeric conversion not representable57: NetCDF: Numeric conversion not representable9: pio_support::pio_die:: myrank=          -1 : ERROR:9: pionfwrite_mod::write_nfdarray_double:         250 :9: NetCDF: Numeric conversion not representable61: NetCDF: Numeric conversion not representable 237:Image              PC                Routine            Line        Source            237:cesm.exe           0000000001B79AAD  Unknown               Unknown  Unknown237:cesm.exe           0000000001548D61  pio_support_mp_pi         118  pio_support.F90237:cesm.exe           000000000154041E  pio_utils_mp_chec          74  pio_utils.F90237:cesm.exe           0000000001657563  pionfwrite_mod_mp         250  pionfwrite_mod.F90.in237:cesm.exe           000000000162409C  piodarray_mp_writ         650  piodarray.F90.in237:cesm.exe           000000000162213C  piodarray_mp_writ         223  piodarray.F90.in237:cesm.exe           00000000016272B3  piodarray_mp_writ         293  piodarray.F90.in237:cesm.exe           00000000004B397C  cam_history_mp_du        4352  cam_history.F90237:cesm.exe           00000000004950D0  cam_history_mp_ws        4669  cam_history.F90237:cesm.exe           00000000004896FC  cam_comp_mp_cam_r         379  cam_comp.F90237:cesm.exe           000000000047A4AE  atm_comp_mct_mp_a         541  atm_comp_mct.F90237:cesm.exe           000000000040A79A  ccsm_comp_mod_mp_        3485  ccsm_comp_mod.F90237:cesm.exe           0000000000425A98  MAIN__                     91  ccsm_driver.F90237:cesm.exe           0000000000407F1E  Unknown               Unknown  Unknown237:libc-2.19.so       00002AAAAC60EB25  __libc_start_main     Unknown  Unknown237:cesm.exe           0000000000407E29  Unknown               Unknown  Unknown237:MPT ERROR: Rank 237(g:237) is aborting with error code 1.237:    Process ID: 31314, Host: r1i6n24, Program: /gpfs/fs1/scratch/yunqian/Kelut_addhetnuc_16_21km_log0.5um1.5um20um_Gaussian_1.5Tg/bld/cesm.exe237:    MPT Version: HPE MPT 2.19  12/07/18 05:31:15 I try to run the model in DEBUG mode. and it shows some information as follows:1:MPT ERROR: Rank 1(g:1) received signal SIGFPE(8).1:MPT: #13 0x0000000001905e1e in pionfwrite_mod::write_nfdarray_double (file=...,1:MPT:     iobuf=..., vardesc=..., iodesc=...,1:MPT:     start=,1:MPT:     count=..., request=0)  There're more messages in this file: /gpfs/fs1/scratch/yunqian/Kelut_addhetnuc_16_21km_log0.5um1.5um20um_Gaussian_1.5Tg/run/cesm.log.190320-183710  Could you help me with this? Thanks,
 

jedwards

CSEG and Liaisons
Staff member
This indicates that you are trying to write bad values to the output file.    You should also use module netcdf-mpi/4.6.1 and ncarcompilers/0.4.1 You should also try changing PIO_TYPENAME in env_run.xml to type pnetcdf
 
Thank you very much.To clarify:in env_mach_specific, I have:module load intel/17.0.1module load ncarenv/1.0module load mklmodule load ncarcompilers/0.4.1module load mpt/2.19if($MPILIB== "mpi-serial" ) then  module load netcdf/4.6.1else  module load netcdf-mpi/4.6.1  module load pnetcdf/1.8.0endif
In env_run.xml, I have:
But the model crash right away with a different error that says the model variables that it read in are NaN numbers.
 
Top