Re: [ofa-general] Re: IPoIB path caching

Sean Hefty Tue, 24 Jul 2007 09:29:22 -0700

Linux has a quite sophisticated mechanism to maintain / cache / probe /invalidate / update the network stack L2 neighbour info.

Path records are not just L2 info. They contain L4, L3, and L2 infotogether.

For example, in the Voltaire gen1 stack we had an ib arp module whichwas used by both IPoIB and native IB ULPs (SDP, iSER, Lustre, etc). Thismodule managed some sort of path cache, were IPoIB was always asking fornon-cached path and other ULPs were willing to get cached path.

IMO, using a cached AH is no different than using a cached path. You'resimply mapping the PR data into another structure.

We're ignoring the problem here, and that is that a centralized SAdoesn't scale. MPI stacks have largely ignored this problem by simplynot doing path record queries. Path information is often hard-coded,with QPN data exchanged out of band over sockets (often over Ethernet).

We've seen problems running large MPI jobs without PR caching. I knowthat Silverstorm/QLogic did as well. And apparently Voltaire hit thesame type of problem, since you added a caching module. (Did Mellanoxand Topspin/Cisco create PR caches as well?) At least three companiesworking on IB came up with the same solution. What is the objection tothe current patch set?


- Sean
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] Re: IPoIB path caching

Reply via email to