Re: Powerset + Hadoop @ Rapleaf

stack Sun, 30 Dec 2007 22:22:41 -0800

"There are also core design flaws. For example, they use threadedIO...This just won’t scale."

FYI, Kevin, hbase puts up non-blocking server sockets to field clientand intra-server communications (It uses Hadoop RPC). Client's ofHadoop's DFS -- e.g. mapreduce jobs, hbase, etc. -- use blockingthread-per-socket for swapping big data blocks. Reportedly, the latterhas been sufficient substrate supporting clusters of thousands of computers.

My guess is that when synchronous socket I/O becomes a bottleneck or agood case -- rather than a "gut feeling" -- can be made that this modelis overly consumptive, changing the HDFS servers to use async I/O willbecome a priority.


St.Ack



Kevin Burton wrote:

With all the activity over the holidays I forgot to post this to the list...


http://feedblog.org/2007/12/18/powerset-hadoop-rapleaf/

Re: Powerset + Hadoop @ Rapleaf

Reply via email to