Re: syncrepl with hardware content switch

Aaron Richton Fri, 26 Sep 2008 07:34:49 -0700

On Thu, 25 Sep 2008, Howard Chu wrote:

Brett @Google wrote:

I was wondering if anybody is using syncrepl in the context of a
hardware content switch or redundant environment.


Yes.

I am considering the edge case where a connection is redirected to a
client, and :

a) client has no current data (new node introduced)
b) client decides it needs to do a full refresh - perhaps it was down
and missed a large number of updates


Yes, you need to keep all servers identical (as much as practical).

Seems to me that such a switch really isn't useful here. Also, if you'rerunning an LDAP service where the network fabric can actually sustain moretraffic than your LDAP servers, you've done something very strange.Considering that a dual-socket quad-core server running OpenLDAP can saturatea gigabit ethernet, I don't see how you can load-balance beyond that. Thecontent switch will become the bottleneck.

It's not so much about saturating the wire (although our current switchesdo 2Gbps each, and I'm sure the next ones will be on the order of 6-8Gbpseach, and we use more than one). It's about service availability -- takingdown a slave and having everything else converge onto the remaining slavesin well under a second. A load balancer handles this much faster than thevast majority of clients configured with multiple servers, and there's noclient delays as they vainly attempt down servers. You also don't have toworry about any software that only allows you to configure a singleserver.

If you're bringing up a brand new replica, just use a separate (virtual, ifnecessary) network interface while it's bootstrapping, and don't enable themain interface until it's caught up.

This is essentially what we do. We start with slapadd -q from recent LDIF.Then, to catch "late breaking changes," we slapd -h ldapi:///. During bothof these procedures, there's nothing listening on the network, so the loadbalancer marks the node as failed. Once contextCSNs appear in sync(discussed at length in the archives), restart slapd with listeners.

Strictly speaking, you could consider one of the contextCSN checks as acustom load balancer check. This might be a bit dangerous, though, sincesyncrepl only guarantees eventual convergence. It's theoretically possiblethat all your slaves would fail out during a particularly large refresh.You'll have to decide for yourself if it's more dangerous to be servingstale data or to be serving no data. We don't do this, because we'd ratherbe serving stale.

Re: syncrepl with hardware content switch

Reply via email to