Thanks for the tips. I got past the errors using the I_MPI_RDMA_RNDV_WRITE 
setting. But now I get the following error 


unexpected DAPL event 4008 from 111:n29
.
.
.

Any ideas?

I am using Cisco HCA's
MT25208 Tavor Compat, DLGL revision A0
FW v4.7.6 build 3.2.0.118

Mahmoud Hanafi
Sr. System Administrator
CSC HPC COE
Bld. 676
2435 Fifth Street
WPAFB, Ohio 45433
(937) 255-1536


Computer Sciences Corporation 
Registered Office: 2100 East Grand Avenue, El Segundo California 90245, 
USA
Registered in USA No: C-489-59

----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
This is a PRIVATE message. If you are not the intended recipient, please 
delete without copying and kindly advise us by e-mail of the mistake in 
delivery. 
NOTE: Regardless of content, this e-mail shall not operate to bind CSC to 
any order or other contract unless pursuant to explicit written agreement 
or government initiative expressly permitting the use of e-mail for such 
purpose.
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------




Arlin Davis <[EMAIL PROTECTED]> 
Sent by: [EMAIL PROTECTED]
02/01/2008 12:58 PM

To
Mahmoud Hanafi/DEF/[EMAIL PROTECTED]
cc
[EMAIL PROTECTED], [email protected]
Subject
Re: [ofa-general] ofed1.2.5rc2 and intel mpi error






Mahmoud Hanafi wrote:
> 
> here is my mpirun command
> 
> mpirun -np 128 -env I_MPI_DEVICE rdma:OpenIB-cma -env I_MPI_DEBUG 2 
> /home/hanafim/HPL/xhpl
> 

What IB adapter are you using and what F/W version (ibstat)?

This might be related to following bug with large RDMA reads: 
https://bugs.openfabrics.org//show_bug.cgi?id=736

Try the following:

Set "-env I_MPI_RDMA_MAX_MSG_SIZE 4194304" to workaround the issue. If 
it does not help, set I_MPI_RDMA_RNDV_WRITE to enable to use RDMA Write 
rendezvous protocol instead of the default RDMA Read.

-arlin

_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit 
http://openib.org/mailman/listinfo/openib-general

_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to