I tried using streaming to dump into a hbase table. As things are
currently written, it unfortunately won't work. Streaming would seem to
presume keys and values of type Text whereas the TableOutputFormat takes
a key of type Text but the value is expected to be MapWritable (where
the keys are column names). Even if you could do types other than Text
in Streaming, an MW type is awkward for php/python, etc., to conpose.
Regards your question as to how php might access hbase, at the moment
your options are few:
+ There is the Edward Yoon patch that you've already tripped over,
hadoop-2171. It puts up an IPC server that fields HQL strings. The
server does the HQL parse and forwards the interpreted request to the
hbase cluster. Included is a first cut at php code that is capable of
making the basic method call against the remote java IPC server.
+ If traffic is light and your requests are read-only, there is the HQL
page in the master's webui.
If hbase had a REST interface, hadoop-2068, would that work for you?
St.Ack
Billy wrote:
Can you show me an example on how that would be down with the command line?
"Michael Stack" <[EMAIL PROTECTED]> wrote in
message news:[EMAIL PROTECTED]
Billy wrote:
..
What I am looking to do is get and store the input and output from/in
hbase.
I haven't tried it but it looks like you can specify input and output
classes for streaming with -inputformat and -outputformat options.
Try setting these to TableInputFormat [1] and TableOutputFormat [2]
respectively.
Usual caveats apply: These hbase classes need to be either bundled into
your job jar -- awkward in this case since you are using the streaming job
jar -- or they need to be on the cluster CLASSPATH (Add the
hadoop*hbase.jar to lib directory across the cluster is probably easiest
thing to do).
St.Ack
1.
http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableInputFormat.html
2.
http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableOutputFormat.html