HI,

i'd like work with key range support to bulkload to avoid too many reducers
mentioned in with these issues (HBASE-9556,HBASE-4063)

Description and high level design for  proposed solution

Currently while we loading bulk data in to Hbase through Mapredue in form
of TableInputFormatBase the number of splits matches the number of regions
in a table
so Here i am going to change the process TableInputFormatBase deceides
range for key splits
 For example if input data going to load data in 50 regions(Actullay RS has
400 Regions)

   - List<InputSplit> getSplits(JobContext context) will find  50 exact
   list of splits (Currently it returns 400 )


Am i understand correctly? please let me know if Am I on the wrong track
.Any one is willing to mentor me because i am new to ASF

Thanks
Beeshma

Reply via email to