Re: Out of memory when putting many rows in an Acc table

Josh Elser Tue, 30 Sep 2014 09:19:30 -0700

You shouldn't have to create a new BatchWriter -- have you triedreducing the amount of memory the BatchWriter will use? It keeps a cacheinternally to try to do an amortization of Mutations to send to a giventabletserver.

To limit this memory, use the BatchWriterConfig#setMaxMemory(long)method. By default, the maxMemory value is set to 50MB. Reducing thismay be enough to hold less data in your client and give you some morehead room.


Alternatively, you could give your client JVM some more heap :)

Geoffry Roberts wrote:

I am try to pump some data into Accumulo but I keep encountering

Exception in thread "Thrift Connection Pool Checker"
java.lang.OutOfMemoryError: Java heap space

at java.util.HashMap.newValueIterator(HashMap.java:971)

at java.util.HashMap$Values.iterator(HashMap.java:1038)

at
org.apache.accumulo.core.client.impl.ThriftTransportPool$Closer.closeConnections(ThriftTransportPool.java:103)

at
org.apache.accumulo.core.client.impl.ThriftTransportPool$Closer.run(ThriftTransportPool.java:147)

at java.lang.Thread.run(Thread.java:745)


I tried, as a work around, creating a new BatchWriter and closing the
old one every ten thousand rows, but to no avail.  Data gets written up
to the 200kth row, then the error.

I have a table of 8M rows in a RDB that I am pumping into Acc via a
groovy script.  The rows are narrow, a short text field and four floats.

I googled of course but nothing was helpful.  What can be done?

Thanks so much.

--
There are ways and there are ways,

Geoffry Roberts

Re: Out of memory when putting many rows in an Acc table

Reply via email to