Greetings,
We are upgrading from 18.x to 19.05.4, but the build process for us appears a
bit different now.
1) There doesn't appear to be a 19.x OSU mvapich2 patch as there were for
previous slurms. Should we use the previous patch or not patch?
2) The acct_gather_profile_hdf5 plugin appears
Okay ... obviously an auto-complete error that I failed to check: Please
ignore and accept my apologies.
> On Dec 16, 2019, at 7:03 AM, Wiegand, Paul wrote:
>
> unlock stokes-arcc
> get stokes-arcc
>
We use TACC's lmod system. It is pretty straightforward to setup and
reasonably well documented:
https://www.tacc.utexas.edu/research-development/tacc-projects/lmod
Paul.
> On Nov 22, 2019, at 12:37 PM, Mariano.Maluf wrote:
>
> Hi all
>
> I am setting up for the first time a cluster with
No, there's more. In terms of the slurm.conf ... you will need to set the
storage host so slurm knows where to go look for slurmdbd. If you are
enforcing any limits, you will need to set those. I also set the job gather
type.
AccountingStorageEnforce
AccountingStorageHost
JobAcctGatherType
I have run scontrol reconfigure while jobs are running many times, and there's
never been an effect on running jobs as far as I have been able to ascertain.
Our cluster stays active 24/7, and we do a variety of things that require a
reconfigure from time-to-time (e.g., add new nodes). If we
So there is a patch?
-- Original message--
From: Fulcomer, Samuel
Date: Wed, May 2, 2018 11:14
To: Slurm User Community List;
Cc:
Subject:Re: [slurm-users] GPU / cgroup challenges
This came up around 12/17, I think, and as I recall the fixes were added to the
src repo then; however,