My make.inc is attached for some details on how I compiled QE.

pw.x runs fine then will stop running with no visible warning in stdout. I
suspect it's either scalapack, the cluster environment, or a combination of
both. I haven't noticed any pattern in if or when pw.x stops in this way.

I've only seen this happen when running large-ish jobs of several hundred
atoms or more on multiple computing nodes when using northo larger than 1
(usually I'll set northo to 16,25,36, or 64..etc depending on the number of
procs and nodes)

I'm really only guessing but it might be some kind of communication issue
between nodes that causes scalapack error out and halt pw.x

Has anyone else come across this behavior?  I've noticed it across multiple
versions of QE and intel mkl/compilers.

Andrew Supka
PhD Student
Central Michigan University

Attachment: make.inc
Description: Binary data

_______________________________________________
Pw_forum mailing list
[email protected]
http://pwscf.org/mailman/listinfo/pw_forum

Reply via email to