Nick, 

Thanks for the quick responds, I will definitely look into the Hadoop 
streaming. 

What do you think about AggregationClient? It is carried out at region/region 
server level, maybe instead do a count/min/avg, a method can be used to write 
the data out to local file system? 

Demai on the run

On Aug 19, 2014, at 5:04 PM, Nick Dimiduk <[email protected]> wrote:

> This sounds an awful lot like a map-only MR job... With Hadoop Streaming,
> you should be able to achieve your goal of piping to an arbitrary process.
> 
> 
> On Tue, Aug 19, 2014 at 4:26 PM, Demai Ni <[email protected]> wrote:
> 
>> Dear experts ,
>> 
>> I understand that I can do a simple command like:
>> 
>> echo "scan 'table1'"| hbase she'll > myoutput
>> 
>> This scenario i am thinking is to:
>> 1) output to local file system(like Linux ) instead of hdfs
>> 2) each regserver only output its only data to it's node's file system
>> 
>> To elaborate the 2) a bit. Basically, this will be like export Hbase data
>> to local file system without going through network. And on each node, one
>> file will be created.
>> 
>> Is there a way to achieve it? Actually the receiving side of 1) doesn't
>> have to be a file system , it can be another process to process the data.
>> But let's use file system to simplify the scenario for now.
>> 
>> Thanks
>> 
>> Demai on the run

Reply via email to