Re: Bulk import tools for HBase

Sean Bigdatafun Mon, 11 Oct 2010 14:34:00 -0700

Another potential "problem" of incremental bulk loader is that the number of
reducers (for the bulk loading process) needs to be equal to the existing
regions -- this seems to be unfeasible for very large table, say with 2000
regions.


Any comment on this? Thanks.

Sean

On Fri, Oct 8, 2010 at 9:03 PM, Todd Lipcon <[email protected]> wrote:

> What version are you building from? These tools are new as of this past
> june.
>
> -Todd
>
> On Fri, Oct 8, 2010 at 4:52 PM, Leo Alekseyev <[email protected]> wrote:
>
>  > We want to investigate HBase bulk imports, as described on
> > http://hbase.apache.org/docs/r0.89.20100726/bulk-loads.html and and/or
> > JIRA HBASE-48.  I can't seem to run either the importtsv tool or the
> > completebulkload tool using the hadoop jar /path/to/hbase-VERSION.jar
> > command.  In fact, the ImportTsv class is not part of that jar file.
> > Am I looking in the wrong place for this class, or do I need to
> > somehow customize the build process to include it?..  Our HBase was
> > built from source using the default procedure.
> >
> > Thanks for any insight,
> > --Leo
> >
>
>
>
> --
> Todd Lipcon
> Software Engineer, Cloudera
>

Re: Bulk import tools for HBase

Reply via email to