What is the solution for this not responding reason? sinfo -R REASON USER TIMESTAMP NODELIST Not responding root 2015-03-10T15:43:59 democlient1
Regards Suprita -----Original Message----- From: Uwe Sauter [mailto:uwe.sauter...@gmail.com] Sent: Tuesday, March 10, 2015 3:34 PM To: slurm-dev Subject: [slurm-dev] Re: node getting again and again to drain or down state In your slurmconf: Procs=2 From the output: Low socket*core*thre How many cores / CPUs / sockets do your nodes have? Am 10.03.2015 um 10:48 schrieb suprita.bot...@wipro.com: > The o/p of sinfo-R is as follows: > REASON USER TIMESTAMP NODELIST > Not responding root 2015-03-10T14:21:11 democlient1 > Low socket*core*thre root 2015-03-10T14:37:51 demomaster1 > And I am attaching configuration file too. > Kindly see to it. > > -----Original Message----- > From: Mehdi Denou [mailto:mehdi.de...@bull.net] > Sent: Tuesday, March 10, 2015 2:48 PM > To: slurm-dev > Subject: [slurm-dev] Re: node getting again and again to drain or down > state > > > What is the output of "sinfo -R" for this node ? > > Le 10/03/2015 10:08, Uwe Sauter a écrit : >> Check that your node resources in slurm.conf represent your actual >> configuration, e.g. that the amount of memory in your node is configured as >> equal or less in slurm.conf. >> >> >> Am 10.03.2015 um 10:05 schrieb suprita.bot...@wipro.com: >>> >>> >>> >>> >>> Hi >>> >>> Please help me if anyone can. >>> >>> I am running command >>> >>> Scontrol update NodeName=xyz state=idle >>> >>> After running this command ny node gets idle state but after >>> sometime again gets back to drain or down state >>> >>> I have cheked my iptables and ip6tables status also its turned off >>> >>> What might be the reason? >>> >>> Kindly help. >>> >>> >>> >>> Regards >>> >>> Suprita >>> >>> The information contained in this electronic message and any >>> attachments to this message are intended for the exclusive use of >>> the >>> addressee(s) and may contain proprietary, confidential or privileged >>> information. If you are not the intended recipient, you should not >>> disseminate, distribute or copy this e-mail. Please notify the >>> sender immediately and destroy all copies of this message and any >>> attachments. WARNING: Computer viruses can be transmitted via email. >>> The recipient should check this email and any attachments for the >>> presence of viruses. The company accepts no liability for any damage >>> caused by any virus transmitted by this email. www.wipro.com > > -- > --- > Mehdi Denou > International HPC support > +336 45 57 66 56 > The information contained in this electronic message and any > attachments to this message are intended for the exclusive use of the > addressee(s) and may contain proprietary, confidential or privileged > information. If you are not the intended recipient, you should not > disseminate, distribute or copy this e-mail. Please notify the sender > immediately and destroy all copies of this message and any > attachments. WARNING: Computer viruses can be transmitted via email. > The recipient should check this email and any attachments for the > presence of viruses. The company accepts no liability for any damage > caused by any virus transmitted by this email. www.wipro.com > The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com