Re: Bulk loading (and/or major compaction) causing OOM

2012-12-08 Thread Bryan Beaudreault
Thanks for the responses guys. Responses inline When you are doing the bulk load, are you pre-split your regions? What OS are you using and what version of Java? Yes, regions are pre-split. We calculated them using M/R before attempting to bulk load the data. We've done this before with

Re: Bulk loading (and/or major compaction) causing OOM

2012-12-08 Thread Marcos Ortiz
On 12/08/2012 11:50 AM, Bryan Beaudreault wrote: Thanks for the responses guys. Responses inline When you are doing the bulk load, are you pre-split your regions? What OS are you using and what version of Java? Yes, regions are pre-split. We calculated them using M/R before attempting to

Re: Bulk loading (and/or major compaction) causing OOM

2012-12-08 Thread Bryan Beaudreault
Merging is not an option for us, because we cannot afford to bring our cluster down. Also, we are not yet convinced that our cluster can handle such large regions due to all the OOM issues we are seeing when trying to bring new, bigger regions online. On Sat, Dec 8, 2012 at 3:42 PM, Marcos

Bulk loading (and/or major compaction) causing OOM

2012-12-07 Thread Bryan Beaudreault
We have a couple tables that had thousands of regions due to the size of the day in them. We recently changed them to have larger regions (nearly 4GB). We are trying to bulk load these in now, but every time we do our servers die with OOM. The logs seem to show that there is always a major

Re: Bulk loading (and/or major compaction) causing OOM

2012-12-07 Thread Marcos Ortiz
On 12/07/2012 04:01 PM, Bryan Beaudreault wrote: We have a couple tables that had thousands of regions due to the size of the day in them. We recently changed them to have larger regions (nearly 4GB). We are trying to bulk load these in now, but every time we do our servers die with OOM.

Re: Bulk loading (and/or major compaction) causing OOM

2012-12-07 Thread Stack
On Fri, Dec 7, 2012 at 1:01 PM, Bryan Beaudreault bbeaudrea...@hubspot.comwrote: We have a couple tables that had thousands of regions due to the size of the day in them. We recently changed them to have larger regions (nearly 4GB). We are trying to bulk load these in now, but every time we