I have a user (in the same sense that I have an ingrown toenail) who runs a job like this...
She starts with 'sbatch -p whatever -n 145 scriptname' The script contains something like: srun foo & srun bar What's happening is that each core is doubly allocated e.g. 16 procs running on an 8 core node. Why isn't the second srun constrained by the resources consumed by the 1st? This is running under (the very excellent) Slurm 2.2.3. Jeff Katcher FHCRC Cluster Monkey
