Re: Bulkload Problem

John Sun, 20 Oct 2013 09:45:34 -0700

thanks for the answers!

I'm not sure if the if the table is pre-splitted, but I don't think so.
Here is the java code: http://pastebin.com/6V5CzasL .


So I think the splitting could be the reason why the region is busy, but
how can  I prevent this problem? Is there any configuration value in hbase
to wait longer? Maybe increase the repeat number from 10 to 10000 or
something like that? Which value is it?

@Ted: I'm not sure which region is causing the problem, there are 7 nodes
and  1 master, so i couldn't paste a spefic log.

kind regards




2013/10/20 Ted Yu <[email protected]>

> John:
> If you can pastebin region server log around 'Sun Oct 20 04:17:52', that
> would help too.
>
> Cheers
>
>
> On Sun, Oct 20, 2013 at 4:02 AM, Jean-Marc Spaggiari <
> [email protected]> wrote:
>
> > Hi John,
> >
> > Is your table pre-splitted?
> >
> > for me, sound like your RS is too busy doing other jobs to reply back to
> > the client.
> >
> > Multiple options.
> > 1) It's due to a long garbage collection. Can you monitor it on your
> > servers?
> > 2) It's because the table is not pre-split and the server is working on
> > that and taking time.
> >
> > How many servers to you have for this test?
> >
> > JM
> >
> >
> > 2013/10/20 John <[email protected]>
> >
> > > Hi,
> > >
> > > I try to load a big amount of data into a hbase cluster. I've imported
> > > successfully up to 3000 Millionen Datasets (KV Pairs). But if I try to
> > > import 6000 Millionen I got this error after 60-95% of the import:
> > > http://pastebin.com/CCp6kS3m ...
> > >
> > > The System is not crashing or anything like this, All nodes are still
> up.
> > > It seems to me that one node is temporarily not available. Maybe is it
> > > possibel to increase the repeat-number? (I think its default 10). What
> > > value do I have to change for that?
> > >
> > >
> > > I'm using Cloudera 4.4.0-1 and the Hbase version 0.94.6-cdh4.4.0
> > >
> > > regards,
> > >
> > > john
> > >
> >
>

Re: Bulkload Problem

Reply via email to