Known issues: Difference between revisions

From Alliance Doc
Jump to navigation Jump to search
No edit summary
No edit summary
Line 31: Line 31:
* <span style="color: red; text-decoration: line-through;">Intel compilers are not producing executables [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 12:53, 15 December 2017 (UTC)
* <span style="color: red; text-decoration: line-through;">Intel compilers are not producing executables [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 12:53, 15 December 2017 (UTC)
** Fixed as of [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 14:48, 15 December 2017 (UTC)</span>
** Fixed as of [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 14:48, 15 December 2017 (UTC)</span>
* <tt>$SLURM_TMPDIR</tt> directory is not being created [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 12:53, 15 December 2017 (UTC)
* <span style="color: red; text-decoration: line-through;"><tt>$SLURM_TMPDIR</tt> directory is not being created [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 12:53, 15 December 2017 (UTC)
* Slurm database has been repaired and jobs are being accepted with all correct accounts. 21:02, 14 December 2017 (UTC)
** Fixed as of [[User:Mboisson|Maxime Boissonneault]] ([[User talk:Mboisson|talk]]) 20:34, 16 December 2017 (UTC)</span>
** <span style="text-decoration: line-through;">Slurm commands (sbatch, salloc, etc) are not working while we restart the database to repair the accounts problem. Expected downtime is about an hour. 17:45, 14 December 2017 (UTC) </span>
** <span style="text-decoration: line-through;">All non-default Slurm accounts are broken; fixing this may require another outage. </span>
* <span style="color: red; text-decoration: line-through;"> /home is on an NFS appliance that does not support ACLs, so setfacl/getfacl doesn't work there. </span> (FIXED as of 2017-12-13)
* <span style="color: red; text-decoration: line-through;"> /home is on an NFS appliance that does not support ACLs, so setfacl/getfacl doesn't work there. </span> (FIXED as of 2017-12-13)
* <span style="color: red; text-decoration: line-through;"> diskusage_report (and alias 'quota') do not report on Graham /home </span>  (FIXED as of 2017-11-27)
* <span style="color: red; text-decoration: line-through;"> diskusage_report (and alias 'quota') do not report on Graham /home </span>  (FIXED as of 2017-11-27)

Revision as of 20:34, 16 December 2017

Other languages:

Report an issue

Shared issues

Scheduler issues

  • Interactive jobs started via salloc do not support X11 forwarding. (reported 13 December 2017 18:37 UTC) (FIXED as of 2017-12-13)
  • The CC Slurm configuration encourages whole-node jobs. When appropriate, users should request whole-node rather than per-core resources. See Job Scheduling - Whole Node Scheduling.
  • By default, the job receives environment settings from the submitting shell. This can lead to irreproducible results if it's not what you expect. To force the job to run with a fresh-like login environment, you can submit with --export=none or add #SBATCH --export=NONE to your job script.

Quota and filesystem problems

Quota errors on /project filesystem

Nearline

Missing symbolic links to project folders

Cedar only

Nothing to report at this time.

Graham only

  • There is an issue with the /project filesystem Maxime Boissonneault (talk) 14:11, 15 December 2017 (UTC)
  • Intel compilers are not producing executables Maxime Boissonneault (talk) 12:53, 15 December 2017 (UTC)
  • $SLURM_TMPDIR directory is not being created Maxime Boissonneault (talk) 12:53, 15 December 2017 (UTC)
  • /home is on an NFS appliance that does not support ACLs, so setfacl/getfacl doesn't work there. (FIXED as of 2017-12-13)
  • diskusage_report (and alias 'quota') do not report on Graham /home (FIXED as of 2017-11-27)
  • Compute nodes cannot access Internet
    • Solution: Contact technical support to request exceptions to be made; describe what you need to access and why.
  • Crontab is not offered on Graham.

Other issues

  1. Modules don't work for shells other than bash(sh) and tcsh.
    • Workaround: (this appears to work but not tested extensively)
      • source $LMOD_PKG/init/zsh
      • source $LMOD_PKG/init/ksh