Re: [OMPI devel] Device failover on ob1

Ralph Castain Tue, 4 Aug 2009 06:50:15 -0400

Rolf/Mouhamed

Could you get together off-list to discuss the different approachesand see if/where there is common ground. It would be nice to see anintegrated solution - personally, I would rather not see twoorthogonal approaches unless they can be cleanly separated. Muchbetter if they could support each other in an intelligent fashion.


On Aug 3, 2009, at 9:49 AM, Pavel Shamis (Pasha) wrote:

I have not, but there should be no difference. The failover codeonly gets triggered when an error happens. Otherwise, there are nodifferences in the code paths while everything is functioningnormally.
Sounds good. I still did not have time to review the code. I willtry to do it during this week.
Pasha
Rolf

On 08/03/09 11:14, Pavel Shamis (Pasha) wrote:
Rolf,
Did you compare latency/bw for failover-enabled code VS trunk ?

Pasha.

Rolf Vandevaart wrote:
Hi folks:
As some of you know, I have also been looking into implementingfailover as well. I took a different approach as I am solvingthe problem within the openib BTL itself. This of course meansthat this only works for failing from one openib BTL to anotherbut that was our area of interest. This also means that we donot need to keep track of fragments as we get them back from thecompletion queue upon failure. We then extract the relevantinformation and repost on the other working endpoint.
My work has been progressing at http://bitbucket.org/rolfv/ompi-failover.
This only currently works for send semantics so you have to runwith -mca btl_openib_flags 1.
Rolf

On 07/31/09 05:49, Mouhamed Gueye wrote:
Hi list,

Here is an update on our work concerning device failover.
As many of you suggested, we reoriented our work on ob1 ratherthan dr and we now have a working prototype on top of ob1. Theapproach is to store btl descriptors sent to peers and deletethem when we receive proof of delivery. So far, we rely oncompletion callback functions, assuming that the message isdelivered when the completion function is called, that is thecase of openib. When a btl module fails, it is removed from theendpoint's btl list and the next one is used to retransmitstored descriptors. No extra-message is transmitted, it onlyconsists in additions to the header. It has been mainly testedwith two IB modules, in both multi-rail (two separate networks)and multi-path (a big unique network).
You can grab and test the patch here (applies on top of thetrunk) :
http://bitbucket.org/gueyem/ob1-failover/
To compile with failover support, just define --enable-device-failover at configure. You can then run a benchmark, disconnecta port and see the failover operate.
A little latency increase (~ 2%) is induced by the failoverlayer when no failover occurs. To accelerate the failoverprocess on openib, you can try to lower thebtl_openib_ib_timeout openib parameter to 15 for example insteadof 20 (default value).
Mouhamed
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] Device failover on ob1

Reply via email to