Daniel Leffel wrote:
The Reduce phase is a simple reduce using the TableReduceOutputFormat.
Try upping the number of reducers -- mapred.reduce.tasks in your
hadoop-site.xml. Default is 1. Try with 4/8/16?
Is your hbase cluster all running on the same node too?
In this configuration, it writes 50-100 or so inserts per second (which
doesn't strike me as terrible), but given the low load factor of 0.20 on the
machine, I don't understand what's keeping it from performing better.
Yeah... doesn't java just idling consume a load of 0.2 (smile). Is this
a multicore box?
St.Ack