Re: Reagrding HBase Hadoop multiple scan objects issue

2013-01-19 Thread Ted Yu
Have you subscribed to user mailing list ? Please do not mix email for user@ and subscription. Some email system would regard messages from amazon.com as unverifiable and put them in Spam folder. What HBase version are you using ? bq. it's inefficient to have one scan object to scan everything

Reagrding HBase Hadoop multiple scan objects issue

2013-01-18 Thread Xu, Leon
Hi HBase users, I am currently trying to set up a denormalization map-reduce job for my HBase Table. Since our table contains large volume of data, it's inefficient to have one scan object to scan everything. We are only need to process those records that have changes. I am planning to have

Re: Reagrding HBase Hadoop multiple scan objects issue

2013-01-18 Thread Doug Meil
Hi there- You probably want to review this section of the RegGuide: http://hbase.apache.org/book.html#mapreduce re: it's inefficient to have one scan object to scan everything. It is. But in the MapReduce case, there is a Map-task for each input split (see the RefGuide for details), and