I guess your hbase.hregion.max.filesize is quite high. If possible, lower its value so that you have smaller regions.
On Sun, Oct 9, 2011 at 7:50 AM, Rita <[email protected]> wrote: > Hi, > > I have been doing a rowcount via mapreduce and its taking about 4-5 hours > to > count a 500million rows in a table. I was wondering if there are any map > reduce tunings I can do so it will go much faster. > > I have 10 node cluster, each node with 8CPUs with 64GB of memory. Any > tuning > advice would be much appreciated. > > > -- > --- Get your facts first, then you can distort them as you please.-- >
