I tried changing the CPU flag int eh compute node section of the conf file to 36, but it didnt make a difference, still limited to 18. Also tried removing the flag and letting slurm calculate it from the other info, e.g.; Sockets=1 CoresPerSocket=18 ThreadsPerCore=2
also no change. Could it be a non configuration issue, e.g. a slurm bug related to the processor type? I only say that because I am normally a torque user, but there is an open bug with Adaptive that seems to be related to some of the newer intel processsors/glibc/elision locking.... On Tue, Sep 6, 2016 at 7:30 PM, andrealphus <andrealp...@gmail.com> wrote: > > ahhhh......I'll give that a try. Thanks Lachlan, feel better! > > On Tue, Sep 6, 2016 at 6:49 PM, Lachlan Musicman <data...@gmail.com> wrote: >> No, sorry, I meant that your config file line needs to change: >> >> >> NodeName=localhost CPUs=36 RealMemory=120000 Sockets=1 CoresPerSocket=18 >> ThreadsPerCore=2 State=UNKNOWN >> >> ------ >> The most dangerous phrase in the language is, "We've always done it this >> way." >> >> - Grace Hopper >> >> On 7 September 2016 at 11:34, andrealphus <andrealp...@gmail.com> wrote: >>> >>> >>> Yup, thats what I expect too! Since Im brand new to slurm, not sure if >>> there is some other config option or srun flag to enable >>> multithreading >>> >>> On Tue, Sep 6, 2016 at 5:42 PM, Lachlan Musicman <data...@gmail.com> >>> wrote: >>> > Oh, I'm not 100% sure on this (home sick actually), but I think: >>> > >>> > NodeName=localhost CPUs=1 RealMemory=120000 Sockets=1 CoresPerSocket=18 >>> > ThreadsPerCore=2 State=UNKNOWN >>> > >>> > >>> > should have CPUs=36 (ie, ThreadsperCore*CoresPerSocket*Sockets) >>> > >>> > cheers >>> > L, >>> > >>> > ------ >>> > The most dangerous phrase in the language is, "We've always done it this >>> > way." >>> > >>> > - Grace Hopper >>> > >>> > On 7 September 2016 at 10:39, andrealphus <andrealp...@gmail.com> wrote: >>> >> >>> >> >>> >> Thanks Lachman, took threads-per-core and out same behavior, still >>> >> limited to 18. >>> >> >>> >> On Tue, Sep 6, 2016 at 5:33 PM, Lachlan Musicman <data...@gmail.com> >>> >> wrote: >>> >> > You don't need --threads-per-core. >>> >> > >>> >> > It's sufficient to have >>> >> > >>> >> > SelectType=select/cons_res >>> >> > SelectTypeParameters=CR_CPU >>> >> > >>> >> > then you should be able to get to all 36. >>> >> > >>> >> > cheers >>> >> > L. >>> >> > >>> >> > ------ >>> >> > The most dangerous phrase in the language is, "We've always done it >>> >> > this >>> >> > way." >>> >> > >>> >> > - Grace Hopper >>> >> > >>> >> > On 7 September 2016 at 10:22, andrealphus <andrealp...@gmail.com> >>> >> > wrote: >>> >> >> >>> >> >> >>> >> >> one more follow up.... >>> >> >> >>> >> >> This seems to limited to the number of cores. Anyway to change it so >>> >> >> that I can run up to the thread limit (18x2) concurrently? >>> >> >> >>> >> >> Thanks! >>> >> >> >>> >> >> On Tue, Sep 6, 2016 at 3:21 PM, andrealphus <andrealp...@gmail.com> >>> >> >> wrote: >>> >> >> > >>> >> >> > spoke too soon, so for posterity.... >>> >> >> > >>> >> >> > need to set, in the conf; >>> >> >> > SelectType=select/con_res >>> >> >> > SelectTypeParameters=CR_CPU >>> >> >> > >>> >> >> > and in the script; >>> >> >> > #SBATCH --threads-per-core=1 >>> >> >> > >>> >> >> > and DefMemPerCPU, did not matter... >>> >> >> > >>> >> >> > >>> >> >> > >>> >> >> > On Tue, Sep 6, 2016 at 3:08 PM, andrealphus >>> >> >> > <andrealp...@gmail.com> >>> >> >> > wrote: >>> >> >> >> >>> >> >> >> Hi all, >>> >> >> >> >>> >> >> >> Long time Torque user, first time SLURM user. I'm running version >>> >> >> >> 15.08 from APT on Ubuntu Xenial. (running on an 18 core CPU >>> >> >> >> E5-2697 >>> >> >> >> v4) >>> >> >> >> >>> >> >> >> I'm trying to figure out the proper slurm.conf configuration, and >>> >> >> >> script parameters to run a job array on a single node/server >>> >> >> >> workstation, with more than one concurrent task of the job >>> >> >> >> running >>> >> >> >> at >>> >> >> >> the time. >>> >> >> >> >>> >> >> >> e.g. >>> >> >> >> >>> >> >> >> #!/bin/bash >>> >> >> >> #SBATCH -o slurm_example-%A_%a.out >>> >> >> >> #SBATCH --array=1-21%3 >>> >> >> >> #SBATCH --mem-per-cpu=2000 f >>> >> >> >> >>> >> >> >> srun sleep 15 >>> >> >> >> >>> >> >> >> and submitting with $sbatch exmaple.sh, should run 21 total >>> >> >> >> instances >>> >> >> >> of sleep, 3 at a time, correct? >>> >> >> >> >>> >> >> >> I can never get more than 1 concurrent process going.... >>> >> >> >> >>> >> >> >> My slurm.conf file looks like; >>> >> >> >> >>> >> >> >> >>> >> >> >> ControlMachine=localhost >>> >> >> >> AuthType=auth/munge >>> >> >> >> CacheGroups=0 >>> >> >> >> CryptoType=crypto/munge >>> >> >> >> MaxTasksPerNode=32 >>> >> >> >> MpiDefault=none >>> >> >> >> ProctrackType=proctrack/pgid >>> >> >> >> ReturnToService=1 >>> >> >> >> SlurmctldPidFile=/var/run/slurmctld.pid >>> >> >> >> SlurmctldPort=6817 >>> >> >> >> SlurmdPidFile=/var/run/slurmd.pid >>> >> >> >> SlurmdPort=6818 >>> >> >> >> SlurmdSpoolDir=/var/spool/slurmd >>> >> >> >> SlurmUser=root >>> >> >> >> StateSaveLocation=/var/spool >>> >> >> >> SwitchType=switch/none >>> >> >> >> TaskPlugin=task/none >>> >> >> >> InactiveLimit=0 >>> >> >> >> KillWait=30 >>> >> >> >> MinJobAge=300 >>> >> >> >> SlurmctldTimeout=120 >>> >> >> >> SlurmdTimeout=300 >>> >> >> >> Waittime=0 >>> >> >> >> FastSchedule=1 >>> >> >> >> SchedulerType=sched/backfill >>> >> >> >> SchedulerPort=7321 >>> >> >> >> SelectType=select/cons_res >>> >> >> >> SelectTypeParameters=CR_CPU >>> >> >> >> AccountingStorageType=accounting_storage/none >>> >> >> >> AccountingStoreJobComment=YES >>> >> >> >> ClusterName=cluster >>> >> >> >> JobCompType=jobcomp/none >>> >> >> >> JobAcctGatherFrequency=30 >>> >> >> >> JobAcctGatherType=jobacct_gather/none >>> >> >> >> SlurmctldDebug=3 >>> >> >> >> SlurmdDebug=3 >>> >> >> >> >>> >> >> >> # COMPUTE NODES >>> >> >> >> NodeName=localhost CPUs=1 RealMemory=120000 Sockets=1 >>> >> >> >> CoresPerSocket=18 ThreadsPerCore=2 State=UNKNOWN >>> >> >> >> PartitionName=debug Nodes=localhost Shared=YES DefMemPerCPU=3000 >>> >> >> >> Default=YES MaxTime=INFINITE State=UP >>> >> >> >> >>> >> >> >> >>> >> >> >> I've tried both; >>> >> >> >> >>> >> >> >> SelectType=select/cons_res >>> >> >> >> SelectTypeParameters=CR_CPU >>> >> >> >> and >>> >> >> >> SelectType=select/linear >>> >> >> >> >>> >> >> >> but both return; >>> >> >> >> sinfo -o %C >>> >> >> >> CPUS(A/I/O/T) >>> >> >> >> 0/0/1/1 >>> >> >> >> >>> >> >> >> which didnt seem right, because I thought if I sent >>> >> >> >> SelectType=select/cons_res & SelectTypeParameters=CR_CPU, the >>> >> >> >> threads >>> >> >> >> should be seen as the CPUS >>> >> >> >> >>> >> >> >> >>> >> >> >> I've tried to piece it together with the slurm and ubuntu mailing >>> >> >> >> list, but two days later am ready to hide in a corner.... >>> >> >> >> >>> >> >> >> any info appreciated! >>> >> >> >> >>> >> >> >> ashton >>> >> > >>> >> > >>> > >>> > >> >>