[slurm-dev] jobs rejected for no reason

Antonia Mey Thu, 18 Aug 2016 07:56:24 -0700

Hi all,

I am a bit out of my depth here and apologies if this is a very trivial 
problem. Slurm rejects jobs due to insufficient resources (sbatch: error: Batch 
job submission failed: Requested node configuration is not available), when the 
partition should definitely accept the following job


#!/bin/bash --login
#SBATCH -n 8
#SBATCH -N 1
#SBATCH -o d66a.out
#SBATCH -e d66a.err
#SBATCH -p GTX
#SBATCH --gres=gpu:1
#SBATCH --time 8:00:00
# Switch to current working directory

scontrol show partition GTX gives:
PartitionName=GTX
   AllowGroups=ALL AllowAccounts=ALL AllowQos=ALL
   AllocNodes=ALL Default=NO
   DefaultTime=NONE DisableRootJobs=NO GraceTime=0 Hidden=NO
   MaxNodes=UNLIMITED MaxTime=2-00:00:00 MinNodes=1 LLN=NO 
MaxCPUsPerNode=UNLIMITED
   Nodes=node0[10-13]
   Priority=1 RootOnly=NO ReqResv=NO Shared=NO PreemptMode=OFF
   State=UP TotalCPUs=128 TotalNodes=4 SelectTypeParameters=N/A
   DefMemPerNode=UNLIMITED MaxMemPerNode=UNLIMITED

I have a similar partition configured in the same way, where the job is happily 
running. Am I missing something essential in the way the partition is 
configured?
Any thoughts? And thanks for any help.

Best,
Antonia

--
Dr Antonia Mey
University of Edinburgh
Department of Chemistry
Joseph Black Building
Edinburgh
EH9 3FJ

Tel: +44 1316507748
Email: [email protected]

signature.asc
Description: Message signed with OpenPGP using GPGMail

[slurm-dev] jobs rejected for no reason

Reply via email to