Re: NotServingRegionException - Map/Reduce process fails

stack Thu, 23 Oct 2008 16:05:57 -0700

Dru Jensen wrote:

Stack,
Sorry for the confusion, I am not using the old implementation ofTableReduce. The new 0.19.0 changed this to an interface. The reduceprocess is performing calculations. It's not just writing to thetable and requires the sort.

Or try running with even more reducers so loading is spread more evenly?

I will change the region size back and see if that helps. If I findthat I need a larger region, should I change the flush by the samemultiple?

Yes.
St.Ack

thanks,
Dru

On Oct 23, 2008, at 2:18 PM, stack wrote:
Any reason you need to use TableReduce? If you delay the insert intohbase till reduce-time, it means 1.), the MR framework has spent abunch of resources shuffling and sorting your data, a sort that isgoing to happen on hbase insert anyways, and 2). your inserts aregoing into hbase in order so you pound one region rather than insertacross all. You might try inserting into hbase at the tail of yourmap task and output nothing (or something small to keep up the jobcounters).
Are your rows > 256MB? At the moment at least, there needs to be abit of balance maintained between flushing, compacting andsplitting. The defaults do that. I'm not sure what happens when youdouble the max filesize but not correspondingly the flushsize. Youmight trying restoring the default (hbase will not try and split arow if its > configured maxfile size).
St.Ack

Re: NotServingRegionException - Map/Reduce process fails

Reply via email to