I have one more doubt .
In the example given in the site

http://wiki.apache.org/hadoop/Hbase/MapReduce

Some codes are written in such a manner that they have only Map classes ..
and no reduce classes
What i understood is that a MAP is generated for every regionserver and it
operates on the data present in that region server ..
Is this idea right??


On Wed, Jul 22, 2009 at 9:58 PM, Erik Holstad <[email protected]> wrote:

> Hi Bharath!
> One of the main benefits of using HBase is that it gives you random access
> to your data. The main goal is not to
> use it for big batch processing jobs going through all or a lot of your
> data. Even though hooks into MapReduce jobs
> gives you that option.
>
> So when ever you fetch data using get and scan, that data is brought to the
> client, for you to process it there. Using
> HBase as the source or sink in a MR this is not the case.
>
> What access patterns do you have to your data, are you doing a lot of
> random
> reads or mostly batch processing of
> data?
>
> Regards Erik
>

Reply via email to