[slurm-users] Two 19.05.4 build questions

2019-12-17 Thread Wiegand, Paul
Greetings, We are upgrading from 18.x to 19.05.4, but the build process for us appears a bit different now. 1) There doesn't appear to be a 19.x OSU mvapich2 patch as there were for previous slurms. Should we use the previous patch or not patch? 2) The acct_gather_profile_hdf5 plugin appears

Re: [slurm-users] get

2019-12-16 Thread Wiegand, Paul
Okay ... obviously an auto-complete error that I failed to check: Please ignore and accept my apologies. > On Dec 16, 2019, at 7:03 AM, Wiegand, Paul wrote: > > unlock stokes-arcc > get stokes-arcc >

Re: [slurm-users] Environment modules

2019-11-22 Thread Wiegand, Paul
We use TACC's lmod system. It is pretty straightforward to setup and reasonably well documented: https://www.tacc.utexas.edu/research-development/tacc-projects/lmod Paul. > On Nov 22, 2019, at 12:37 PM, Mariano.Maluf wrote: > > Hi all > > I am setting up for the first time a cluster with

Re: [slurm-users] Enable SLURM Accounting

2018-05-28 Thread Wiegand, Paul
No, there's more. In terms of the slurm.conf ... you will need to set the storage host so slurm knows where to go look for slurmdbd. If you are enforcing any limits, you will need to set those. I also set the job gather type. AccountingStorageEnforce AccountingStorageHost JobAcctGatherType

Re: [slurm-users] Running 'scontrol reconfigure" while jobs are running

2018-05-28 Thread Wiegand, Paul
I have run scontrol reconfigure while jobs are running many times, and there's never been an effect on running jobs as far as I have been able to ascertain. Our cluster stays active 24/7, and we do a variety of things that require a reconfigure from time-to-time (e.g., add new nodes). If we

Re: [slurm-users] GPU / cgroup challenges

2018-05-02 Thread Wiegand, Paul
So there is a patch? -- Original message-- From: Fulcomer, Samuel Date: Wed, May 2, 2018 11:14 To: Slurm User Community List; Cc: Subject:Re: [slurm-users] GPU / cgroup challenges This came up around 12/17, I think, and as I recall the fixes were added to the src repo then; however,