Billy wrote:
..
What I am looking to do is get and store the input and output from/in hbase.
I haven't tried it but it looks like you can specify input and output
classes for streaming with -inputformat and -outputformat options.
Try setting these to TableInputFormat [1] and TableOutputFormat [2]
respectively.
Usual caveats apply: These hbase classes need to be either bundled into
your job jar -- awkward in this case since you are using the streaming
job jar -- or they need to be on the cluster CLASSPATH (Add the
hadoop*hbase.jar to lib directory across the cluster is probably easiest
thing to do).
St.Ack
1.
http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableInputFormat.html
2.
http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableOutputFormat.html