Hi ,
I have installed slurm on a 2 node cluster.
On the master node when I run sinfo command I get below output.
sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
debug* up infinite 2 idle punehpcdl[01-02]
But on compute node:Slurmd daemon is also running but it gives the
Hi,
I am using slurm15.08.2 version.
And I am using mysql as database,where accounting storage user is with password.
Accountingstorageuser=root and Storagepass=root@123
But after configuring slurm.conf and slurmdbd.conf and starting slurm services
I am only able to start slurmdbd and slurmd
Hi
I am submitting job using sbatch command.
And my script is as follows:
#!/bin/sh
source /etc/profile
source $HOME/.bashrc
#SBATCH --job-name=222e
#SBATCH --output=slurm-%j.out
#SBATCH --error=slurm-%j.err
#SBATCH --ntasks=1
#SBATCH --mem-per-cpu=50
#SBATCH --account=dhvani
#SBATCH
Hi,
I have installed slurm 14.11.5
Can someone help,I am getting error on running slurmctld -Dv as follows:
slurmctld: debug3: Trying to load plugin
/opt/slurm/lib/slurm/accounting_storage_mysql.so
slurmctld: debug3: Couldn't find sym 'acct_storage_p_reconfig' in the plugin
slurmctld:
Hi
Can please someone help me in knowing that why slurmctld is getting killed in
very few seconds.
And the error for squeue and sinfo is as follows
slurm_load_partitions: Unable to contact slurm controller (connect failure) And
also on running :slurmctld -Dvvv
I get the following line:
CentOS-6.5
slurm 14.03.0
Installed from source
-Original Message-
From: Uwe Sauter [mailto:uwe.sauter...@gmail.com]
Sent: Wednesday, March 25, 2015 5:43 PM
To: slurm-dev
Subject: [slurm-dev] Re: SLURMCTLD ERROR
Please provide more information:
Which OS? Which Slurm version? Installed
CentOS-6.5
slurm 14.03.0
Installed from source
-Original Message-
From: Uwe Sauter [mailto:uwe.sauter...@gmail.com]
Sent: Wednesday, March 25, 2015 5:43 PM
To: slurm-dev
Subject: [slurm-dev] Re: SLURMCTLD ERROR
Please provide more information:
Which OS? Which Slurm version? Installed
What is the solution for this not responding reason?
sinfo -R
REASON USER TIMESTAMP NODELIST
Not responding root 2015-03-10T15:43:59 democlient1
Regards
Suprita
-Original Message-
From: Uwe Sauter [mailto:uwe.sauter...@gmail.com]
Sent: Tuesday,
I had 1 core on each node.
Changed the conf file and restarted slurm
-Original Message-
From: Uwe Sauter [mailto:uwe.sauter...@gmail.com]
Sent: Tuesday, March 10, 2015 3:34 PM
To: slurm-dev
Subject: [slurm-dev] Re: node getting again and again to drain or down state
In your slurmconf:
The o/p of sinfo-R is as follows:
REASON USER TIMESTAMP NODELIST
Not responding root 2015-03-10T14:21:11 democlient1
Low socket*core*thre root 2015-03-10T14:37:51 demomaster1
And I am attaching configuration file too.
Kindly see to it.
-Original
Hi
Please help me if anyone can.
I am running command
Scontrol update NodeName=xyz state=idle
After running this command ny node gets idle state but after sometime again
gets back to drain or down state
I have cheked my iptables and ip6tables status also its turned off
What might be the
11 matches
Mail list logo