[slurm-dev] RE: slurm_load_partitions: Unable to contact slurm controller (connect failure)

2016-10-25 Thread suprita.bothra
Hi , I have installed slurm on a 2 node cluster. On the master node when I run sinfo command I get below output. sinfo PARTITION AVAIL TIMELIMIT NODES STATE NODELIST debug* up infinite 2 idle punehpcdl[01-02] But on compute node:Slurmd daemon is also running but it gives the

[slurm-dev] Slurmctld daemon not starting.

2015-11-17 Thread suprita.bothra
Hi, I am using slurm15.08.2 version. And I am using mysql as database,where accounting storage user is with password. Accountingstorageuser=root and Storagepass=root@123 But after configuring slurm.conf and slurmdbd.conf and starting slurm services I am only able to start slurmdbd and slurmd

[slurm-dev] Requested configuration not available

2015-09-14 Thread suprita.bothra
Hi I am submitting job using sbatch command. And my script is as follows: #!/bin/sh source /etc/profile source $HOME/.bashrc #SBATCH --job-name=222e #SBATCH --output=slurm-%j.out #SBATCH --error=slurm-%j.err #SBATCH --ntasks=1 #SBATCH --mem-per-cpu=50 #SBATCH --account=dhvani #SBATCH

[slurm-dev] slurmctld -Dvvv

2015-03-29 Thread suprita.bothra
Hi, I have installed slurm 14.11.5 Can someone help,I am getting error on running slurmctld -Dv as follows: slurmctld: debug3: Trying to load plugin /opt/slurm/lib/slurm/accounting_storage_mysql.so slurmctld: debug3: Couldn't find sym 'acct_storage_p_reconfig' in the plugin slurmctld:

[slurm-dev] SLURMCTLD ERROR

2015-03-25 Thread suprita.bothra
Hi Can please someone help me in knowing that why slurmctld is getting killed in very few seconds. And the error for squeue and sinfo is as follows slurm_load_partitions: Unable to contact slurm controller (connect failure) And also on running :slurmctld -Dvvv I get the following line:

[slurm-dev] Re: SLURMCTLD ERROR

2015-03-25 Thread suprita.bothra
CentOS-6.5 slurm 14.03.0 Installed from source -Original Message- From: Uwe Sauter [mailto:uwe.sauter...@gmail.com] Sent: Wednesday, March 25, 2015 5:43 PM To: slurm-dev Subject: [slurm-dev] Re: SLURMCTLD ERROR Please provide more information: Which OS? Which Slurm version? Installed

[slurm-dev] Re: SLURMCTLD ERROR

2015-03-25 Thread suprita.bothra
CentOS-6.5 slurm 14.03.0 Installed from source -Original Message- From: Uwe Sauter [mailto:uwe.sauter...@gmail.com] Sent: Wednesday, March 25, 2015 5:43 PM To: slurm-dev Subject: [slurm-dev] Re: SLURMCTLD ERROR Please provide more information: Which OS? Which Slurm version? Installed

[slurm-dev] Re: node getting again and again to drain or down state

2015-03-10 Thread suprita.bothra
What is the solution for this not responding reason? sinfo -R REASON USER TIMESTAMP NODELIST Not responding root 2015-03-10T15:43:59 democlient1 Regards Suprita -Original Message- From: Uwe Sauter [mailto:uwe.sauter...@gmail.com] Sent: Tuesday,

[slurm-dev] Re: node getting again and again to drain or down state

2015-03-10 Thread suprita.bothra
I had 1 core on each node. Changed the conf file and restarted slurm -Original Message- From: Uwe Sauter [mailto:uwe.sauter...@gmail.com] Sent: Tuesday, March 10, 2015 3:34 PM To: slurm-dev Subject: [slurm-dev] Re: node getting again and again to drain or down state In your slurmconf:

[slurm-dev] Re: node getting again and again to drain or down state

2015-03-10 Thread suprita.bothra
The o/p of sinfo-R is as follows: REASON USER TIMESTAMP NODELIST Not responding root 2015-03-10T14:21:11 democlient1 Low socket*core*thre root 2015-03-10T14:37:51 demomaster1 And I am attaching configuration file too. Kindly see to it. -Original

[slurm-dev] node getting again and again to drain or down state

2015-03-10 Thread suprita.bothra
Hi Please help me if anyone can. I am running command Scontrol update NodeName=xyz state=idle After running this command ny node gets idle state but after sometime again gets back to drain or down state I have cheked my iptables and ip6tables status also its turned off What might be the