Translations:Running jobs/75/en
Jump to navigation
Jump to search
When a computation is going to require a long time to complete, so long that it cannot be done within the time limits on the system, the software has to support checkpointing. The software should be able to save its complete state to a file, called a checkpoint, and then it should be able to restart and continue the computation from that saved state.