Hello all,
Everytime I use sbatch, the job is always in pending status so it never be
executed.
I have tried to find the solution in mail archive but I didn't find a match.
For debugging simplicity, I run slurmctld and slurmd in one machine.
Following is the output of squeue command :
pro@head-node:/mirror/source$ squeue
JOBID PARTITION NAME USER ST TIME
NODES NODELIST(REASON)
70 comeon MatMul pro PD 0:00
1 (Resources)
71 comeon MatMul pro PD 0:00
1 (Resources)
72 comeon MatMul pro PD 0:00
1 (Resources)
here is control machine and compute node configuration in slurm.conf:
ControlMachine=head-node
ControlAddr=head-node
#BackupController=
#BackupAddr=
...
...
...
# COMPUTE NODES
NodeName=DEFAULT CPUs=8 RealMemory=5949 TmpDisk=281483 State=UNKNOWN
NodeName=head-node NodeAddr=head-node SocketsPerBoard=1 CoresPerSocket=4
ThreadsPerCore=2
PartitionName=DEFAULT State=UP
PartitionName=comeon Nodes=head-node MaxTime=30 MaxNodes=2 Default=YES
and here is my sbatch script :
#!/bin/bash
#SBATCH -J MatMul
#SBATCH -o myMM.%j.out
#SBATCH -A pro
#SBATCH -N 1
#SBATCH -n 2
#SBATCH --time=00:30:00
#SBATCH [email protected]
#SBATCH --mail-type=begin
#SBATCH --mail-type=end
salloc mpiexec ./mm.o
Anyone please tell me how to solve this ?
is the something misconfigured ?
Thank you in advance
Regards,
Husen