Translations:Using node-local storage/11/en

From Alliance Doc
Revision as of 18:29, 31 July 2020 by FuzzyBot (talk | contribs) (Importing a new version from external source)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Output data must be copied from $SLURM_TMPDIR back to some permanent storage before the job ends. If a job times out, then the last few lines of the job script might not be executed. This can be addressed two ways:

  • First, obviously, request enough run time to let the application finish. We understand that this isn't always possible.
  • Write checkpoints to network storage, not to $SLURM_TMPDIR.