Thanks for the tips. I got past the errors using the I_MPI_RDMA_RNDV_WRITE setting. But now I get the following error
unexpected DAPL event 4008 from 111:n29 . . . Any ideas? I am using Cisco HCA's MT25208 Tavor Compat, DLGL revision A0 FW v4.7.6 build 3.2.0.118 Mahmoud Hanafi Sr. System Administrator CSC HPC COE Bld. 676 2435 Fifth Street WPAFB, Ohio 45433 (937) 255-1536 Computer Sciences Corporation Registered Office: 2100 East Grand Avenue, El Segundo California 90245, USA Registered in USA No: C-489-59 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Arlin Davis <[EMAIL PROTECTED]> Sent by: [EMAIL PROTECTED] 02/01/2008 12:58 PM To Mahmoud Hanafi/DEF/[EMAIL PROTECTED] cc [EMAIL PROTECTED], [email protected] Subject Re: [ofa-general] ofed1.2.5rc2 and intel mpi error Mahmoud Hanafi wrote: > > here is my mpirun command > > mpirun -np 128 -env I_MPI_DEVICE rdma:OpenIB-cma -env I_MPI_DEBUG 2 > /home/hanafim/HPL/xhpl > What IB adapter are you using and what F/W version (ibstat)? This might be related to following bug with large RDMA reads: https://bugs.openfabrics.org//show_bug.cgi?id=736 Try the following: Set "-env I_MPI_RDMA_MAX_MSG_SIZE 4194304" to workaround the issue. If it does not help, set I_MPI_RDMA_RNDV_WRITE to enable to use RDMA Write rendezvous protocol instead of the default RDMA Read. -arlin _______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
_______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
