I have one more doubt . In the example given in the site http://wiki.apache.org/hadoop/Hbase/MapReduce
Some codes are written in such a manner that they have only Map classes .. and no reduce classes What i understood is that a MAP is generated for every regionserver and it operates on the data present in that region server .. Is this idea right?? On Wed, Jul 22, 2009 at 9:58 PM, Erik Holstad <[email protected]> wrote: > Hi Bharath! > One of the main benefits of using HBase is that it gives you random access > to your data. The main goal is not to > use it for big batch processing jobs going through all or a lot of your > data. Even though hooks into MapReduce jobs > gives you that option. > > So when ever you fetch data using get and scan, that data is brought to the > client, for you to process it there. Using > HBase as the source or sink in a MR this is not the case. > > What access patterns do you have to your data, are you doing a lot of > random > reads or mostly batch processing of > data? > > Regards Erik >
