Hi Magnus,

We had the same challenge some time ago. A long description of solutions is in my Wiki page at https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_configuration/#temporary-job-directories

The issue may have been solved in https://bugs.schedmd.com/show_bug.cgi?id=12567 which will be in Slurm 23.02.

At this time, the auto_tmpdir SPANK plugin seems to be the best solution.

IHTH,
Ole

On 1/12/23 08:49, Hagdorn, Magnus Karl Moritz wrote:
Hi there,
we excitedly found the job_container/tmpfs plugin which neatly allows
us to provide local scratch space and a way of ensuring that /dev/shm
gets cleaned up after a job finishes. Unfortunately we found that it
does not play nicely with autofs which we use to provide networked
project and scratch directories. We found that this is a known issue
[1]. I was wondering if that has been solved? I think it would be
really useful to have a warning about this issue in the documentation
for the job_container/tmpfs plugin.
Regards
magnus

[1]
https://cernvm-forum.cern.ch/t/intermittent-client-failures-too-many-levels-of-symbolic-links/156/4

--
Ole Holm Nielsen
PhD, Senior HPC Officer
Department of Physics, Technical University of Denmark

Reply via email to