OptClim‐UKESM problems - optclim/ModelOptimisation GitHub Wiki

It suggested this page develop as problems arise and are recognised.

Diagnostics

Check what is running on Archer2

squeue -u $USER -l

Check what is running on PUMA

cylc gscan &

right click on suites showing an error - red square - to open that suite in gcylc.
ungroup the tasks (RUN_MAIN and HPC_SERIAL usually)
right click on  a task that failed to see the log.err and log.out

if nothing is running, has it finished?

  • all suites concluded, no pending SLURM job
  • study json with _final in path name and with costd with in it.
  • last jobOutput/RE file concludes with statement about reaching: ** its target ** maximum number of models allowed to be run (maxfun in the suite's json)

was there a problem with environment or the suite's json?

On ARCHER2

  • was ". ~/setup_optclim.sh" run and the expected python environment set in the PATH ("which python")

  • Did runAlgorthm.py work?

  • check trace from initial run the latest suitedir/jobOutput/RE... file

  • check the expected interface directories exist with *** runParams.txt set up? *** observables,json hold ing simulate dobservables ( Are there error messages in the jobOutput/PP... files?)

  • check the latest suitedir/jobOutput/RE... file