I think you should take a look at filters in hbase. You can apply a filter such that the input to mapp-reduce would be data from only one industry.
On 02-Jan-2012, at 12:38, Karthik Pandian <[email protected]> wrote: > > I am crawling different industry data and storing the data into single hbase > table. For example I am crawling Electronics and Computer industries and > stored in a table called 'industry_tbl'. Now I want to run a map reduce on > the sets of data namely for Electronics and computer industries and produce > the reducer output with the different sets of data collected but currently > hbase is taking the entire data of both the industries and giving me the > reduced results which I cant differentiate by Industries. > > Any Help or idea on how to solve this? > > -- > View this message in context: > http://old.nabble.com/Hbase-Map-reduce-and-Index-tp33064563p33064563.html > Sent from the HBase User mailing list archive at Nabble.com. >
