Translations:Using node-local storage/11/en

From Alliance Doc
Revision as of 20:58, 20 March 2023 by FuzzyBot (talk | contribs) (Importing a new version from external source)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Output data must be copied from $SLURM_TMPDIR back to some permanent storage before the job ends. If a job times out, then the last few lines of the job script might not be executed. This can be addressed three ways:

  • request enough runtime to let the application finish, although we understand that this isn't always possible;
  • write checkpoints to network storage, not to $SLURM_TMPDIR;
  • write a signal trapping function.