Hi all, I am a bit out of my depth here and apologies if this is a very trivial problem. Slurm rejects jobs due to insufficient resources (sbatch: error: Batch job submission failed: Requested node configuration is not available), when the partition should definitely accept the following job
#!/bin/bash --login #SBATCH -n 8 #SBATCH -N 1 #SBATCH -o d66a.out #SBATCH -e d66a.err #SBATCH -p GTX #SBATCH --gres=gpu:1 #SBATCH --time 8:00:00 # Switch to current working directory scontrol show partition GTX gives: PartitionName=GTX AllowGroups=ALL AllowAccounts=ALL AllowQos=ALL AllocNodes=ALL Default=NO DefaultTime=NONE DisableRootJobs=NO GraceTime=0 Hidden=NO MaxNodes=UNLIMITED MaxTime=2-00:00:00 MinNodes=1 LLN=NO MaxCPUsPerNode=UNLIMITED Nodes=node0[10-13] Priority=1 RootOnly=NO ReqResv=NO Shared=NO PreemptMode=OFF State=UP TotalCPUs=128 TotalNodes=4 SelectTypeParameters=N/A DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED I have a similar partition configured in the same way, where the job is happily running. Am I missing something essential in the way the partition is configured? Any thoughts? And thanks for any help. Best, Antonia -- Dr Antonia Mey University of Edinburgh Department of Chemistry Joseph Black Building Edinburgh EH9 3FJ Tel: +44 1316507748 Email: [email protected]
signature.asc
Description: Message signed with OpenPGP using GPGMail
