Creating a Table using HFileOutputFormat

Renaud Delbru Thu, 23 Sep 2010 04:17:12 -0700

Hi,

we are trying to create a hbase table from scratch using map-reduce andHFileOutputFormat. However, we haven't really find examples or tutorialson how to do this, and there is some aspects which are still unclear forus. We are using hbase 0.20.x.

First, what is the correct way to use HFileOutputFormat and to createHFile ?We are simply using a map function which output <ImmutableBytesWritable(key), Put (value)>, an identity reducer, and we configure the job touse HFileOutputFormat as an output format class.However, we have seen in hbase 0.89.x a more complex way to do it,involving sorting (KeyValueSortReducer, or PutSortReducer) and apartitioner (TotalOrderPartitioner). The HFileOutputFormat provides aconvenience method, configureIncrementalLoad, to automatically configurethe hadoop job. Is this method needed in our case ? Ir is this onlynecessary in the case where the table already exists (incremental bulkload) ?

Do we have to reimplement this for 0.20.x ?

Then, one time the table creation job is successful, how do we importthe hfiles into hbase ? Is it by using the hbase cli import command ?


Thanks in advance for your answers,
Regards
--
Renaud Delbru

Creating a Table using HFileOutputFormat

Reply via email to