[slurm-dev] Re: Coman equivalent slurm LSF

2014-10-21 Thread lfelipe
Dear all: Thanks to all for your comments and advice. Luis Felipe Ruiz Nieto El 20/10/14 a las #4, Morris Jette escribió: Try srun --noalloc as root On October 20, 2014 4:43:49 AM PDT, lfelipe luis.r...@uam.es wrote: Thanks for your reply. The problem is that if the machine is

[slurm-dev] Re: Partition for unused resources until needed by any other partition

2014-10-21 Thread Mikael Johansson
Thanks! That looks like something that could be useful indeed. We are for the moment stuck with version 2.2.7, though, and if I understood the docs correctly, most of the partition based parameters are of later date and versions. We might upgrade in some future, though. It also seems like

[slurm-dev] Odd (ReqNodeNotAvail) and (PartitionNodeLimit) with multiple partitions

2014-10-21 Thread Mikael Johansson
Hello All, I had a problem with jobs being stuck in the queue and not being scheduled even with unused cores on the cluster. The system has four partitions, three different high priority ones and one lower priority, backfill partition. A concise description of the setup in slurm.config,

[slurm-dev] Unable to start slurmdbd after system crash

2014-10-21 Thread Trey Dockendorf
Yesterday the VMs running slurmctld, slurmdbd and MySQL all crashed due to issues with storage networks. I am now unable to start slurmdbd. The error I get is below. The part about is marked as crashed and should be repaired, is that coming from MySQL or from SLURM? Any advice on how to

[slurm-dev] Re: Unable to start slurmdbd after system crash

2014-10-21 Thread Trey Dockendorf
Sorry for the noise, I had never seen this error before so figured it was SLURM specific, but it's not. Been long day and should have googled it first :) mysql mysql -e REPAIR TABLE proc resolved the issue. - Trey = Trey Dockendorf Systems Analyst I Texas AM

[slurm-dev] Re: Prune database before migration to 14.11 ?

2014-10-21 Thread Christopher Samuel
On 16/10/14 16:02, Christopher Samuel wrote: No worries, we're going to test out ours in a sandbox as well, so we'll be able to compare it to our (pretty beefy) DB servers. It took around 2 minutes to add all the indexes in our sandbox, thats with a total of about 6 million jobs across 5