Dear list,

One of our users faces problems running his application (large CP2K cases)

Cluster:
OpenMPI 1.4.2, SLES 9, gcc 4.1.2, OFED 1.4 on Intel Nehalem (5350)

The message is:

[[45776,1],214][btl_openib_component.c:2951:handle_wc] from node140 to:
node400 error polling LP CQ with status LOCAL QP OPERATION ERROR status
number 2 for wr_id 250502144 opcode 1  vendor error 103 qp_idx 0

OpenMPI has been compiled using the following flags:

./configure --prefix=/som/prefix/dir --enable-branch-probabilities
--enable-mem-debug --enable-mem-profile --enable-picky --enable-peruse
--enable-per-user-config-files --enable-cxx-exceptions
--enable-mpi-threads --enable-openib-ibcm --enable-openib-rdmacm --with-sge

Any idea why and/or if something is wrong in the configuration ? Any fix ?

Thanks in advance

Best regards
Vince

-- 
---------------------------------------------------
Dr. Vincent KELLER

Universität Zürich
           http://www.hpcn.uzh.ch
ADDRESS:   Winterthurstrasse 190
           CH - 8057 Zürich
           Switzerland
PHONE  :   + 41 (0) 44/635'40'37
FAX    :   + 41 (0) 44/635'45'05

Reply via email to