This means that you have some problem on that node,
and it's probably unrelated to Open MPI.
Bad cable? Bad port? FW/driver in some bad state?
Do other IB performance tests work OK on this node?
Try rebooting the node.

-- YK

On 12-Sep-11 7:52 AM, Ahsan Ali wrote:
> Hello all
> 
> I am getting following error during an application run which causes it to 
> crash.
> 
> *[[36944,1],41][btl_openib_component.c:3227:handle_wc] from 
> compute-01-19.private.dns.zone to: compute-01-04 error polling LP CQ with 
> status RETRY EXCEEDED ERROR status number 12 for wr_id 167703304 opcode 128  
> vendor error 129 qp_idx 3*
> 
> I removed that particular node and then the error was removed.Please suggest 
> me what could be the solution to this. Thanking you in advance.
> 
> -- 
> Syed Ahsan Ali Bokhari
> Electronic Engineer (EE)
> 
> Research & Development Division
> Pakistan Meteorological Department H-8/4, Islamabad.
> Phone # off  +92518358714
> Cell # +923155145014
> 
> 
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to