Dear Xiongyi,
could you share a bit more about how you compile and run the calculation?
It will be even better if you can you also share the problem (including
input case, how you built and how you run) on the QE GitLab to track the
bug properly. https://gitlab.com/QEF/q-e/-/issues
Thank you
Dear QE developers
Recently, I run the pw.x with GPU acceleration of QE7.0. And I run the pw.x
with Serial version, my gpu is NVIDIA A100.
I found that Total Wall time is much larger than Total CPU time as follows:
PWSCF: 4d 2h16m CPU 5d 9h56m WALL
In addition, I found that