This means that you have some problem on that node, and it's probably unrelated to Open MPI. Bad cable? Bad port? FW/driver in some bad state? Do other IB performance tests work OK on this node? Try rebooting the node.
-- YK On 12-Sep-11 7:52 AM, Ahsan Ali wrote: > Hello all > > I am getting following error during an application run which causes it to > crash. > > *[[36944,1],41][btl_openib_component.c:3227:handle_wc] from > compute-01-19.private.dns.zone to: compute-01-04 error polling LP CQ with > status RETRY EXCEEDED ERROR status number 12 for wr_id 167703304 opcode 128 > vendor error 129 qp_idx 3* > > I removed that particular node and then the error was removed.Please suggest > me what could be the solution to this. Thanking you in advance. > > -- > Syed Ahsan Ali Bokhari > Electronic Engineer (EE) > > Research & Development Division > Pakistan Meteorological Department H-8/4, Islamabad. > Phone # off +92518358714 > Cell # +923155145014 > > > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users