hank williams wrote:
I have been meaning to ask a similar question, but perhaps a bit more
broadly about the status and anticipated timeline for hbase. I am
curious if the effort is purely individual or if there is any
corporate push (for example from powerset) or if it is just a personal
project.
Regards the state of HBase, I just made updates to the HBase home page
news section and edited the tail of the architecture page where it talks
of current status. JIRA is -- or should be -- the best source for what
is currently being worked on.
Broadly, from my point of view:
+ I am able to start up a cluster of 4 nodes and run simple sustained
loads without the cluster falling over.
+ I'm currently working on loading it with a bunch of web pages with
columns to hold original content, parsed content, etc. It looks like
there is sufficient functionality currently implemented for me to do
this (we'll see).
+ Its slow (See the PerformanceEvaluation page in wiki).
+ Its fragile (E.g HADOOP-1516)
Regards timeline, Powerset where both Jim Kellerman and I work would
love to have a working HBase as soon as possible (smile).
Jim and I are both working on HBase near full-time. Others are also
making good contributions.
But, IMO, HBase is a big job and its going to take a load of work by a
bunch of folks making it robust at scale.
Also, as I understand it from the wiki, hbase works well on a single
machine but scaling is being worked out. Is there any sense of when an
anticipated (or desired) target for a scalable version?
Adding nodes to a HBase cluster currently makes it so that the cluster
can carry more load and handle more requests. In a sense, with caveats
-- some of which have been mentioned above while others are yet to be
discovered -- HBase scales now.
James Kennedy wrote:
"We need easy interfaces to MapReduce jobs, so they can scan tables.
We have been contacted by Vuk Ercegovac <vercego AT SPAMFREE us DOT
ibm DOT com> of IBM Almaden Research who expressed an interest in
working on an HBase interface to Hadoop map/reduce."
Does anyone know what the status of this is?
I do not know of any work that has been done on the above. I made an
issue for it (HADOOP-1519).
St.Ack
Regards
Hank
On 6/21/07, James Kennedy <[EMAIL PROTECTED]> wrote:
From Wiki:
"We need easy interfaces to MapReduce jobs, so they can scan tables. We
have been contacted by Vuk Ercegovac
<vercego AT SPAMFREE us DOT ibm DOT com> of IBM Almaden Research who
expressed an interest in working on an HBase interface to Hadoop
map/reduce."
Does anyone know what the status of this is?