Scheduled Downtime
On Tuesday 24 October 2023 @ 5pm MT the forums will be in read only mode in preparation for the downtime. On Wednesday 25 October 2023 @ 5am MT, this website will be down for maintenance and expected to return online later in the morning.
Normal Operations
The forums are back online with normal operations. If you notice any issues or errors related to the forums, please reach out to help@ucar.edu

ERROR in ./case.submit

Jeline

New Member
Dear CESM scientists,

I ran CESM2.1.5 on the Intel classic compiler with the following case settings:

  1. ./create_newcase --compset IHistClm50BgcCrop --res f09_g17 --case $HOME/cases/testfast --mach mytest --compiler intel --run-unsupported
  2. vi env_run.xml # chenge STOP_N and STOP_OPTION to 5 and nyears ; change REST_N and REST_OPTION to 1 and month
  3. ./case.setup
  4. vi env_mach_pes.xml # change NTASKS to -1, change COST_PES and TOTALPES to 48 cause the machine only has 48 cores.
  5. ./case.setup --reset
  6. ./preview_run
  7. ./case.build
  8. ./case.submit
Then an error occurred:

ERROR: RUN FAIL: Command 'mpirun -np 48 /home/loopy/CESM/scratch/testfast/bld/cesm.exe >> cesm.log.$LID 2>&1 ' failed
See log file for details: /home/loopy/CESM/scratch/testfast/run/cesm.log.250802-143708

The log file mentioned in the error has been attached, but I didn't find any ERROR-related prompts in it. Additionally, I have also attached the modified env_run.xml and env_mach_pes.xml files. Why is this error occurring? How should I modify it? Thanks for your reply.
 

Attachments

  • cesmlog.txt
    32 KB · Views: 1
  • env_mach_pes.txt
    6.9 KB · Views: 0
  • env_run.txt
    59.1 KB · Views: 0
Solution
It seems to be a memory issue. The computer I'm using has 48 cores. After changing the number of CESM usage nodes from 48 to 24, this problem was resolved. However, a new issue occurred in ./case.submit. It still reported "ERROR: RUN FAIL: Command 'mpirun -np 48 /home/loopy/CESM/scratch/testfast/bld/cesm.exe >> cesm.log.$LID 2>&1 ' failed" as an error, but the content of the log file increased. Could you please explain why this new issue occurred?

Jeline

New Member
It seems to be a memory issue. The computer I'm using has 48 cores. After changing the number of CESM usage nodes from 48 to 24, this problem was resolved. However, a new issue occurred in ./case.submit. It still reported "ERROR: RUN FAIL: Command 'mpirun -np 48 /home/loopy/CESM/scratch/testfast/bld/cesm.exe >> cesm.log.$LID 2>&1 ' failed" as an error, but the content of the log file increased. Could you please explain why this new issue occurred?
 

Attachments

  • cesmlog250802-160949.txt
    36.5 KB · Views: 1
Vote Upvote 0 Downvote
Solution

dbailey

CSEG and Liaisons
Staff member
Looks like you might have some initial files missing?

NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Variable not found
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Variable not found
 
Vote Upvote 0 Downvote

Jeline

New Member
Looks like you might have some initial files missing?

NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Variable not found
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Invalid dimension ID or name
NetCDF: Variable not found
Thank you for your reply. How can I determine exactly which NetCDF files are missing?
 
Vote Upvote 0 Downvote
Top