Hi Xiang,
- The values over the ocean are there so that, if the land/ocean mask ever changes, I don't have to update the input files (as I would if I applied the mask). They will not be used if a gridcell has no land (and therefore no crop area). They were generated with a nearest-neighbor extrapolation, in case you were wondering.
- Yes, sorry, that information is outdated. (It still applies for anyone who turns off the new default crop calendar system, though.)
- The original crop calendar data come from the Global Gridded Crop Model Intercomparison / ISIMIP-Crop input dataset. The original dataset and more information are available here. It has input data even in ridiculous places because GGCMI/ISIMIP runs require that we simulate every crop in every gridcell. As with the data over the ocean, a crop's values won't be used if CLM has no area of that crop there.
- Yes, you would change the values only in the gridcells that have rice in your simulations. (Again, you could change the values in gridcells without rice, but that would have no effect.) You would use the surface dataset if using an 1850 or 2000 compset, but if using a Hist or SSP compset, you would want to look at the appropriate landuse timeseries file to identify gridcells that EVER have rice.