Re: Multiple ZK clusters or a single, shared cluster?

Jonathan Gray Fri, 17 Jul 2009 14:41:15 -0700

Yup.

I would say that the use of ZK by HBase today is very minimal. Very fewwrites at all, almost exclusively reads and still not that often.Dedicated resources would not make much of a difference.

For users who are just testing, developing, or running clusters that arenot highly loaded, it is not as important to have dedicated nodes _yet_.

The next version of HBase, 0.21, we are planning on widely expandingwhat we use ZK for. At that time, we will be writing much more often,and reading constantly. Dedicated nodes/disks would be highlyrecommended at that point.


Thanks for the explanation.

JG

Benjamin Reed wrote:

you need a dedicated disk for the logDir, but not the dataDir. thereason is that the write to the log is in the critical path: we cannotcommit changes until they have been synced to disk, so we want to makesure that we don't contend for the disk. the snapshots in the dataDirare done in an asynchronous thread, so it can be written to a disk usedby other applications (usually the OS disk).
the problem with running other processes on the zookeeper server issimilar to the disk contention: if zookeeper starts contending with arunaway app for CPU and memory, we can start timing out because ofstarvation.
by using a dedicated disk for logs and a dedicated machine you can getvery high deterministic performance.
make sense?

ben

Jonathan Gray wrote:
Thanks for the input.
Honestly, I'm thinking I need to have separate clusters. The versionof ZK is one thing; but also for an application like HBase, we havehad periods where we needed to patch ZK before it became part of arelease. Keeping track of that on a shared cluster will be tricky,if not impossible.
And with a small development team and a very fast dev cycle, I'm alittle concerned about a runaway application hosing all the otherdependencies on ZK...
What are the actual reasons for wanting a separate disk for ZK?Strictly reliability purposes? Should that disk be dedicated to thelogDir but not the dataDir, or both?
If I don't give it a dedicated disk or node, but it has 1GB of memoryand a core, what are the downsides? Are they just about reliability?If I could run 5 or 7 zk nodes, but co-hosted with my HBase cluster,is that really less reliable than 3 separate nodes, as long as the jvmhas sufficient resources? Or are there performance or usabilityconcerns as well?
Sorry for all the questions, just trying to get the story straight sothat we don't spread misinformation to HBase users. Most users startout on very small clusters, so dedicated ZK nodes are not a realisticassumption... How big of a deal is that?
JG

Benjamin Reed wrote:
we designed zk to have high performance so that it can be shared bymultiple applications. the main thing is that you use dedicated zkmachines (with a dedicated disk for logging). once you have that inplace, watch the load on your cluster, as long as you aren'tsaturating the cluster you should share.
as you point out running multiple clusters is a hardware investment,plus you miss out on opportunities to improve reliability. forexample, if you have three applications that have a cluster of 3 zkservers each, one failure will result in an outage. if instead ofusing the 9 servers you have the same three applications use a zkcluster with 7 servers you can tolerate three failures without anoutage.
the key of course is to make sure that you don't oversubscribe theserver.
ben

Jonathan Gray wrote:
Hey guys,
Been using ZK indirectly for a few months now in the HBase and Kattarealms. Both of these applications make it really easy so you don'thave to be involved much with managing your ZK cluster to support it.
I'm now using ZK for a bunch of things internally, so now I'mmanually configuring, starting, and managing a cluster.
What advice is there about whether I should be sharing a singlecluster between all my applications, or running separate ones foreach use?
I've been told that it's strongly recommended to run your ZK nodesseparately from the application using them (this is actually whatwe're telling new users over in HBase, though a majority ofinstallations will likely co-host them with DataNodes andRegionServers).
I don't have the resources to maintain a separate 3+ node ZK clusterfor each of my applications, so this is not really an option. I'mtrying to decide if I should have HBase running/managing it's own ZKcluster that is co-located with some of the regionservers (therewill be ample memory, but ZK will not have a dedicated disk), or ifI should be pointing it to a dedicated 3 node ZK cluster.
I would then also have Katta pointing at this same shared cluster(or a separate cluster would be co-located with katta nodes). Samefor my application; could share nodes with the app servers orpointed at a single ZK cluster.
Trade-offs I should be aware of?  Current best practices?

Any help would be much appreciated.  Thanks.

Jonathan Gray

Re: Multiple ZK clusters or a single, shared cluster?

Reply via email to