Dear Lukasz,

 The difference in computation time between lapw1 and lapwso
calculations is expected in band calculations. The lapwso step
involves the calculation of spin-orbit coupling, which can be
computationally more demanding compared to the lapw1 step that
calculates the bands without spin-orbit coupling.

 The lapwso calculation includes additional interactions between the
spin of the electron and its orbital motion, which requires more
computational resources and time. Therefore, it is normal to observe a
longer runtime for lapwso compared to lapw1.

 In your case, the lapwso process is utilizing a significant portion
of the CPU resources, as indicated by the high CPU usage percentages
(%CPU) in the top output. The memory usage (%MEM) is also relatively
high for the lapwso processes.

 It appears that you have allocated sufficient resources (OMP=2) for
the lapwso step, and your system has ample memory available.
Therefore, the longer runtime can be attributed to the inherent
complexity of the spin-orbit coupling calculations rather than a
resource limitation.

 If you need to optimize the performance further, you may consider
adjusting the OMP settings or exploring parallelization options with
k-points or MPI to distribute the workload across multiple cores or
processors. However, it's important to note that the total runtime for
the lapwso step will inherently be longer due to the nature of the
calculations involved.

Suggestions for trying:
omp_global:4
#omp_lapw1:2
#omp_lapw2:2
#omp_lapwso:2

In WIEN2k, k-points parallelization can be more efficient. You can use
the testpara_lapw command to assess if increasing the number of
"1:localhost" lines in your .machines file is necessary. testpara_lapw
is a utility program in the WIEN2k package that helps determine the
optimal number of lines (k-points) needed for accurate calculations.

Compiling the code with appropriate optimization flags can
significantly improve the performance and speed of calculations. Here
are some additional suggestions related to code compilation:
Experiment with different optimization levels. Most compilers provide
different optimization levels, such as -O1, -O2, -O3. Higher
optimization levels generally provide better performance but may
increase compilation time. Use the suggestion of the siteconfig_lapw
for the right balance of the code.
Ensure that you are using the latest version of the code, i.e.,
WIEN2k_23.2. The respectful Developers have recently released
significant updates and bug fixes that can improve performance, super
thanks to Peter Blaha and all the developers.

Best regards,
Saeid


Quoting pluto via Wien <wien@zeus.theochem.tuwien.ac.at>:

Dear All,

When calculating bands for a large slab I have following sequence:

Sun May 14 12:33:03 PM CEST 2023> (x) lapw1 -band -up -p
Sun May 14 02:25:26 PM CEST 2023> (x) lapw1 -band -dn -p
Sun May 14 04:17:22 PM CEST 2023> (x) lapwso -up -p
Mon May 15 01:30:05 AM CEST 2023> (x) qtl -up -p -band -so
Mon May 15 01:30:05 AM CEST 2023> (x) lapw2 -p -fermi -so -up
Mon May 15 01:51:51 AM CEST 2023> (x) qtl -dn -p -band -so
Mon May 15 01:51:51 AM CEST 2023> (x) lapw2 -p -fermi -so -dn

As you can see lapwso takes much longer than lapw1 (approx. 9h vs
2h). Is this normal for band calculations?

I have 128 GB of RAM in this computer, so this is not a RAM issue.
Here is what top shows for the lapwso calculation (I have 4 parallel
localhost processes in .machines, OMP=2 and no mpi):

Tasks: 505 total,   2 running, 503 sleeping,   0 stopped,   0 zombie
%Cpu(s): 24.0 us,  0.2 sy,  0.0 ni, 75.7 id,  0.0 wa,  0.1 hi,  0.0
si,  0.0 st
MiB Mem : 128047.1 total,   1845.8 free,  16809.0 used, 111158.6 buff/cache
MiB Swap:  32088.0 total,  31471.5 free,    616.5 used. 111238.1 avail Mem

   PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND
1336417 lplucin   20   0 6417856   4.8g  15840 R 199.3   3.8   1294:13 lapwso
1336392 lplucin   20   0 2848204   2.3g  15880 S 146.8   1.9   1295:30 lapwso
1336391 lplucin   20   0 2848188   2.4g  15916 S 130.6   1.9   1304:23 lapwso
1336396 lplucin   20   0 2848060   2.3g  15816 S  99.7   1.9   1288:06 lapwso

.machines file:

omp_global:8
omp_lapw1:2
omp_lapw2:2
omp_lapwso:2
1:localhost
1:localhost
1:localhost
1:localhost
granularity:1

Best,
Lukasz
_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wienSEARCH the
MAILING-LIST at: 
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html
_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at:  
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

Reply via email to