[Wien] .machines for several nodes

Christian Søndergaard Pedersen Mon, 12 Oct 2020 04:07:51 -0700

Hello everybody


I am new to WIEN2k, and am struggling with parallellizing calculations on our 
HPC cluster beyond what can be achieved using OMP. In particular, I want to 
execute run_lapw and/or runsp_lapw running on four identical nodes (16 cores 
each), parallellizing over k points (unless there's a more efficient scheme). 
To achieve this, I try to mimic the example from the User Guide (without the 
extra Alpha node), but my .machines-file does not work the way I intended. This 
is what I have:


#

dstart:g008:4 g021:4 g025:4 g028:4

lapw0:g008:4 g021:4 g025:4 g028:4

lapw1:g008:4 g021:4 g025:4 g028:4

lapw2:g008:4 g021:4 g025:4 g028:4

granularity:1

extrafine:1


The node names gxxx are read from SLURM_JOB_NODELIST in the submit script, and 
a couple of regular expressions generate the above lines. Afterwards, my job 
script does the following:


srun hostname -s > slurm.hosts
run_lapw -p

which results in a job that idles for the entire walltime and finishes with a 
CPU efficiency of 0.00%. I would appreciate any help in figuring out where I've 
gone wrong.


Best regards
Christian

_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at:  
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

[Wien] .machines for several nodes

Reply via email to