Known issues: Difference between revisions

Known issues (view source)

304 bytes removed , 7 years ago

no edit summary

Bureaucrats, cc_docs_admin, cc_staff, rsnt_translations

2,848

edits

@@ Line 10: / Line 10: @@
 == Scheduler issues == <!--T:6-->
 * The CC Slurm configuration encourages whole-node jobs. When appropriate, users should request whole-node rather than per-core resources. See [[Job_scheduling_policies#Whole_nodes_versus_cores;|Job Scheduling - Whole Node Scheduling]].
-* CPU and GPU backfill partitions have been created on the Cedar and Graham clusters. If a job is submitted with <24hr runtime, it will be automatically entered into the cluster-wide backfill partition. This partition has low priority, but will allow increased utilization of the cluster by serial jobs.
 * Slurm epilog does not fully clean up processes from ended jobs, especially if the job did not exit normally.
 ** This has been greatly improved after the addition of the epilog.clean script, but there are still nodes occasionally marked down for epilog failure.