This sounds an awful lot like a map-only MR job... With Hadoop Streaming, you should be able to achieve your goal of piping to an arbitrary process.
On Tue, Aug 19, 2014 at 4:26 PM, Demai Ni <[email protected]> wrote: > Dear experts , > > I understand that I can do a simple command like: > > echo "scan 'table1'"| hbase she'll > myoutput > > This scenario i am thinking is to: > 1) output to local file system(like Linux ) instead of hdfs > 2) each regserver only output its only data to it's node's file system > > To elaborate the 2) a bit. Basically, this will be like export Hbase data > to local file system without going through network. And on each node, one > file will be created. > > Is there a way to achieve it? Actually the receiving side of 1) doesn't > have to be a file system , it can be another process to process the data. > But let's use file system to simplify the scenario for now. > > Thanks > > Demai on the run
