Hi Greg, As for cross datacenters, we have tested zookeeper cross data centers and it works fine. The only thing is that you might have to tweak synclimit and tickTime to a little higher values for Zookeeper.
http://hadoop.apache.org/zookeeper/docs/r3.1.0/zookeeperAdmin.html#sc_config uration provides documentation on these parameters. As for communication protocol within zookeeper servers, we currently use raw tcp sockets to send and receive data. I cannot estimate the time it would take the time for it to use https, but it wont be just a week of work for sure. Also, the client to zookeeper server is raw tcp again. We would certainly like to have security in Zookeeper. Currently, Hadoop Core is also working on getting security in place. We plan to have a similar security model as theirs (I think they are looking at kerberos -- not sure) . mahadev On 2/26/09 10:24 AM, "bebble zap" <beeble...@gmail.com> wrote: > We're thinking about using Zookeeper as our coordination service and > also for doing group membership in the Amazon Cloud. Currently our > applications are deployed in Amazon Cloud on multiple availability > zones (i.e. data centers), so this means that ZK nodes will be talking > across datacenters. I'm assuming that the additional latency from > going across datacenters shouldn't be too big of an issue. Also, we > are paranoid about security in the cloud, so we'd like to use https as > the communications protocol for Zookeeper -- not sure if this is a > trivial thing to do or not. Wondering if anyone's already doing this > today or whether Zookeeper is not the right solution given our > environment currently. > > Thanks > Greg