Hi Qian,
I can give you an example of one of our systems that uses zookeeper
(crawling for our Yahoo! search engine). It has of the order of 4K clients -
expecting to grow to 8K to 12K. Their write load is around 100 writes/sec
(this is pretty low) and 4K reads/sec (max reads) with an ensemble
Thanks mahadev, that's a simple and elegant solution. I feel pretty
dumb not thinking of it myself! :( it should be very straightforward
to implement too. We were using a database to store blobs and to
generate ids. I replaced the blob storage with hadoop hdfs and the ids
with