Billy wrote:
..

What I am looking to do is get and store the input and output from/in hbase.
I haven't tried it but it looks like you can specify input and output classes for streaming with -inputformat and -outputformat options.

Try setting these to TableInputFormat [1] and TableOutputFormat [2] respectively.

Usual caveats apply: These hbase classes need to be either bundled into your job jar -- awkward in this case since you are using the streaming job jar -- or they need to be on the cluster CLASSPATH (Add the hadoop*hbase.jar to lib directory across the cluster is probably easiest thing to do).

St.Ack

1. http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableInputFormat.html 2. http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/javadoc/org/apache/hadoop/hbase/mapred/TableOutputFormat.html

Reply via email to