I think you should take a look at filters in hbase. You can apply a filter such 
that the input to mapp-reduce would be data from only one industry. 

On 02-Jan-2012, at 12:38, Karthik Pandian <[email protected]> wrote:

> 
> I am crawling different industry data and storing the data into single hbase
> table. For example I am crawling Electronics and Computer industries and
> stored in a table called 'industry_tbl'. Now I want to run a map reduce on
> the sets of data namely for Electronics and computer industries and produce
> the reducer output with the different sets of data collected but currently
> hbase is taking the entire data of both the industries and giving me the
> reduced results which I cant differentiate by Industries.
> 
> Any Help or idea on how to solve this?
> 
> -- 
> View this message in context: 
> http://old.nabble.com/Hbase-Map-reduce-and-Index-tp33064563p33064563.html
> Sent from the HBase User mailing list archive at Nabble.com.
> 

Reply via email to