Sean Hefty wrote:
> Sean Hefty wrote:

>> An alternative is to send a DREP in response to a DREQ, even if a local
>> connection is not found, which is what this patch does.

> If there are no objections, I will commit this patch to svn, and submit for 
> inclusion upstream.

Sean,

My understanding is that without this patch the side that sends the DREQ 
would do few DREQ resends as of the "firsts" DREPs being lost and no 
DREPs sent once the id at the peer side left the timewait state, correct?

Arlin,

Can you please share what were the implications with intel MPI running a 
64 nodes (128 ranks?) job? was the issue here just making the ***job 
termination time*** bigger?

I don't have an objection for merging it, i just think it can be nice if 
we understand better what problem this patch comes to solve in terms of 
this use case that has driven the fix.

Or.


_______________________________________________
openib-general mailing list
[email protected]
http://openib.org/mailman/listinfo/openib-general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to