Lingering restart files can cause re‐ran cases to fail - TerrenceMcGuinness-NOAA/global-workflow GitHub Wiki
Looking at the JGLOBAL_FORECAST script, I can see how the paths are constructed, but you're right - we need to understand what specific case/run is causing the issue and where exactly it's looking for and finding these problematic restart files.
Let's analyze how the paths are determined:
-
For non-ensemble runs:
export DATAjob="${DATAROOT}/${RUN}fcst.${PDY:-}${cyc}"
-
For ensemble members:
export DATAjob="${DATAROOT}/${RUN}efcs${ENSMEM}.${PDY:-}${cyc}"
-
Then the restart directory is set as:
export DATArestart="${DATAjob}/restart"
From the error log, we can extract some information to identify the specific case:
-
Looking at the paths in the error log, I can see:
/gpfs/f6/drsa-precip3/scratch/role.glopara/RUNDIRS/C48_S2SWA_gefs_28431ae/gefs.2021032312/gefsfcst.2021032312
-
Breaking this down:
DATAROOT
appears to be/gpfs/f6/drsa-precip3/scratch/role.glopara/RUNDIRS/C48_S2SWA_gefs_28431ae/gefs.2021032312
RUN
appears to begefs
PDY
is20210323
cyc
is12
-
The job is finding restart files with timestamp
20210325.120000
in:/gpfs/f6/drsa-precip3/scratch/role.glopara/RUNDIRS/C48_S2SWA_gefs_28431ae/gefs.2021032312/gefsfcst.2021032312/restart/FV3_RESTART
-
These restart files are two days ahead of the current cycle date (March 23 vs March 25), which is why the job is trying to use them as a warm start, even though they shouldn't be used.
This means we're looking at a GEFS (Global Ensemble Forecast System) run for March 23, 2021, 12Z, but the restart directory contains files from a future date (March 25, 2021), which is causing the issue.
The configuration C48_S2SWA_gefs_28431ae
suggests this is a low-resolution (C48) Subseasonal-to-Seasonal (S2S) configuration with wave and aerosol components (S2SWA) for GEFS, with a specific commit ID or branch identifier (28431ae).
To summarize:
- The run is a GEFS forecast for 2021-03-23 12Z
- It's using the C48_S2SWA_gefs configuration
- The restart directory contains files from 2021-03-25 12Z
- The code is finding these restart files and trying to warm start from them
- This is causing the forecast to fail
The cleanup code you've added to JGLOBAL_FORECAST should prevent this by removing any existing restart directories at the start of the run, ensuring that old restart files won't be detected and incorrectly used.