Re: [slurm-users] Enforcing GPU-CPU ratios

2023-03-14 Thread Kilian Cavalotti
On Tue, Jun 23, 2020 at 7:37 AM Bas van der Vlies wrote: > > Which version of slurm do you use? as slurm 19.05: > * DefCpuPerGPU Sorry for necroposting and undigging this old thread, but the DefCpuPerGpu configuration option is actually just a default, which will happily get overridden by job

[slurm-users] Additional Container Runtime (`scrun`) Commands

2023-03-14 Thread Jesse Farebrother
Hi, I just finished testing out the new container support in 23.02. It’s really exciting the progress that’s been made so far! Sadly, most tooling doesn’t respect the OCI runtime specification and instead we live in a reality where most people define the spec in terms of the major

Re: [slurm-users] Partition Hold/Release

2023-03-14 Thread Nicolas Sonoda
Hi Marcus, Thank you very much for the response. I set the PriorityTier for my partitions and also set PreemptType=preempt/partition_prio and PreemptMode=SUSPEND,GANG. But the job in the low priority partition does not change it state to SUSPEND. Have any idea? Following are some information:

[slurm-users] cgroups issue

2023-03-14 Thread Boris Yazlovitsky
I sent this a while ago - don't know if it got to the mailing list: I'm running slurm 23.02.0 on ubuntu 14.04 when a batch job is submitted, getting this message in the error file: slurmstepd: error: common_file_write_content: unable to write 1 bytes to cgroup

Re: [slurm-users] changing the operational network in slurm setup

2023-03-14 Thread Paul Edmon
We do this for our Infiniband set up.  What we do is that we populate /etc/hosts with the hostname mapped to the IP we want Slurm to use.  This way you get IP traffic traversing the address you want between nodes while not having to mess with DNS. -Paul Edmon- On 3/14/2023 12:19 AM, Purvesh

Re: [slurm-users] Regarding Multi-Cluster Accounting Information

2023-03-14 Thread Shaghuf Rahman
Hi, I tried adding the 2 individual account in cluster A and ClusterB respectively and 1 account which is added to both the cluster # sacctmgr show user cluster=alpha User Def Acct Admin -- -- - user1 alpha_grp None user2 test None

Re: [slurm-users] sbatch does not work with Debian image

2023-03-14 Thread Shunran Zhang
The error message says that slurm cannot find slurm config file. Do you have a local copy of /etc/slurm/* or share the /etc/slurm across NFS, or using DNS & configless slurm? Sincerely, S. Zhang > Sorin Draga 於2023/03/14 18:49寫道: > >  > Hello everyone, > > I'm trying to run the new Debian

Re: [slurm-users] Partition Hold/Release

2023-03-14 Thread Wagner, Marcus
Hi Nicolas, you could use the prioritytier for partitions:    PriorityTier   Jobs submitted to a partition with a higher PriorityTier value will be evaluated by the scheduler before pending jobs in a partition with a lower PriorityTier value. They  will   also 

Re: [slurm-users] sbatch does not work with Debian image

2023-03-14 Thread olivier sallou
On Tue, 2023-03-14 at 05:19 -0400, Sorin Draga wrote: > Hello everyone, >   > I'm trying to run the new Debian image with SLURM, namely debian-11- > bullseye-v20230206 but sbatch/sinfo do not seem to work. I installed > slurm-client via: > sudo apt-get install slurm-client > > but it did not

[slurm-users] sbatch does not work with Debian image

2023-03-14 Thread Sorin Draga
Hello everyone, I'm trying to run the new Debian image with SLURM, namely debian-11-bullseye-v20230206 but sbatch/sinfo do not seem to work. I installed slurm-client via: sudo apt-get install slurm-client but it did not solve the problem: sinfo: error: resolve_ctls_from_dns_srv: res_nsearch

[slurm-users] Regarding Multi-Cluster Accounting Information

2023-03-14 Thread Shaghuf Rahman
Hi, I am new to slurm. I am setting up a multi cluster environment. I have 1 small doubt with respect to the user accounting. My setup will look like below: Cluster name A: *Alpha* (Slurmctld) Cluster name B: *Beta* (Slurmctld) Both controllers are pointing to the same database server. My slurm