Hello experts,
I have compiled QE for GPU (MPI + OpenMP). I've that while using more than one core (mpirun -np 8) the calculation becomes very slow but it is way more faster when I do "mpirun -np 1". Is there a reason for that?
I have only 1 GPU and i have added "export OMP_NUM_THREADS=1" in the bashrc
thank you.
_______________________________________________ The Quantum ESPRESSO community stands by the Ukrainian people and expresses its concerns about the devastating effects that the Russian military offensive has on their country and on the free and peaceful scientific, cultural, and economic cooperation amongst peoples _______________________________________________ Quantum ESPRESSO is supported by MaX (www.max-centre.eu) users mailing list [email protected] https://lists.quantum-espresso.org/mailman/listinfo/users
