Hi folks,

We're just about to let users back onto our systems after RHEL 6.6
upgrades and moving from Slurm 2.6.x to 14.03.10.

However, running NAMD with Open-MPI 1.6.x and mpirun leads to this
error at the end of the output (which appears totally cosmetic).

[...]
The last velocity output (seq=-2) takes 0.029 seconds, 980.234 MB of memory in 
use
====================================================

WallClock: 117.003998  CPUTime: 117.003998  Memory: 980.234375 MB
End of program
slurmstepd: _slurm_cgroup_destroy: problem deleting step cgroup path 
/cgroup/freezer/slurm/uid_500/job_2497190/step_batch: Device or resource busy


Now I've checked the cgroup release agent config and it's all set
up correctly looking at:

http://slurm.schedmd.com/cgroups.html#cleanup

Anyone got any ideas?

PS: No I can't use srun directly as we get poor scaling, the next
thing in the list (after SC14) is to migrate to Open-MPI 1.8.4 which
is due out shortly which should address this.

cheers,
Chris
-- 
 Christopher Samuel        Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computation Initiative
 Email: [email protected] Phone: +61 (0)3 903 55545
 http://www.vlsci.org.au/      http://twitter.com/vlsci

Reply via email to