Why not just one load? Check why the scanners are timing out. Are the regions splitting under you while you scan? Do you have the hbase rebalancer turned on?
On Sep 12, 2011, at 7:51 AM, Norbert Burger <[email protected]> wrote: > Folks -- we have a timeseries-based table we recently converted to a salted > key schema [1] in order to avoid region hotspotting. The rowkey format is: > > salt-timestamp-sessionid-eventtype, where: > > salt has the form 00..13, and the timestamp is a Unix timestamp (epoch > based). > > With the version 0.10.0 HBaseStorage, what's the recommended way to LOAD a > salted schema from Pig? Initially, I thought we'd just fire off multiple > LOADs, one for each region (in our case, up to 14), but we're hitting > frequently ScannerTimeoutExceptions with this approach, even on a sample > script that does nothing but LOADs. > > Is there a better way? > > Thanks, > Norbert > > [1] > http://ofps.oreilly.com/titles/9781449396107/advanced.html#ch09_id2336987
