Speeding up LoadIncrementalHFiles?

Adam Phelps Wed, 30 Mar 2011 18:33:25 -0700

Does anyone have any suggestions for speeding up LoadIncrementalHFiles?

We have M/R jobs that directly generate HFiles and are then loaded intoHBase via LoadIncrementalHFiles. We're attempting to maintain a backupof our production HBase on a backup Hadoop cluster by copying the HFilesthere and then loading them there.

The problem we're running into is that we want the backup cluster to usea good number fewer nodes than the primary cluster, however despitehaving a pretty low load (CPU, disk IO, etc) it isn't keeping up well.We'd rather not dedicate more nodes from the overall pool to thispurpose if at all possible. Are there any settings that can be adjustedto improve the performance of the bulk load?

Alternate suggestions for maintaining an HBase backup would also be ofinterest.


- Adam

Speeding up LoadIncrementalHFiles?

Reply via email to