For reference, slurm.conf: http://pastebin.com/XT6TvQhh
I've been tasked with setting up a small cluster for a research group where I work, despite knowing relatively little about HPC or clusters in general. I've installed slurm on the eight compute nodes and the login node, but, I'm having two issues currently: 1. I cannot specify a memory requirement other than --mem=1 Sample submission output with --mem=2: http://pastebin.com/5PY9N6n4 2. I cannot get nodes to execute more than one job at a time. The 9th job is always queued with reason Resources. I think this is related to the lines scontrol: Consumable Resources (CR) Node Selection plugin loaded with argument 17 scontrol: Serial Job Resource Selection plugin loaded with argument 17 scontrol: Linear node selection plugin loaded with argument 17 because it seems like slurm is only allocating whole nodes at a time. Sorry if this is basic setup, but I've tried googling to no end. -- Travis DePrato Computer Science & Engineering Math and Music Minors Student at University of Michigan Computer Consultant at EECS DCO
