Re: NotServingRegionException - Map/Reduce process fails

Dru Jensen Thu, 23 Oct 2008 15:44:55 -0700

Stack,

Sorry for the confusion, I am not using the old implementation ofTableReduce. The new 0.19.0 changed this to an interface. The reduceprocess is performing calculations. It's not just writing to thetable and requires the sort.

I will change the region size back and see if that helps. If I findthat I need a larger region, should I change the flush by the samemultiple?


thanks,
Dru

On Oct 23, 2008, at 2:18 PM, stack wrote:

Any reason you need to use TableReduce? If you delay the insertinto hbase till reduce-time, it means 1.), the MR framework hasspent a bunch of resources shuffling and sorting your data, a sortthat is going to happen on hbase insert anyways, and 2). yourinserts are going into hbase in order so you pound one region ratherthan insert across all. You might try inserting into hbase at thetail of your map task and output nothing (or something small to keepup the job counters).
Are your rows > 256MB? At the moment at least, there needs to be abit of balance maintained between flushing, compacting andsplitting. The defaults do that. I'm not sure what happens whenyou double the max filesize but not correspondingly the flushsize.You might trying restoring the default (hbase will not try and splita row if its > configured maxfile size).
St.Ack

Re: NotServingRegionException - Map/Reduce process fails

Reply via email to