Sean Hefty wrote: > Sean Hefty wrote: >> An alternative is to send a DREP in response to a DREQ, even if a local >> connection is not found, which is what this patch does.
> If there are no objections, I will commit this patch to svn, and submit for > inclusion upstream. Sean, My understanding is that without this patch the side that sends the DREQ would do few DREQ resends as of the "firsts" DREPs being lost and no DREPs sent once the id at the peer side left the timewait state, correct? Arlin, Can you please share what were the implications with intel MPI running a 64 nodes (128 ranks?) job? was the issue here just making the ***job termination time*** bigger? I don't have an objection for merging it, i just think it can be nice if we understand better what problem this patch comes to solve in terms of this use case that has driven the fix. Or. _______________________________________________ openib-general mailing list [email protected] http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
