Jonas Borgström created CASSANDRA-15006:
-------------------------------------------

             Summary: Possible java.nio.DirectByteBuffer leak
                 Key: CASSANDRA-15006
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-15006
             Project: Cassandra
          Issue Type: Bug
         Environment: cassandra: 3.11.3
jre: openjdk version "1.8.0_181"
heap size: 2GB
memory limit: 3GB (cgroup)

I started one of the nodes with "-Djdk.nio.maxCachedBufferSize=262144" but that 
did not seem to make any difference.
            Reporter: Jonas Borgström
         Attachments: Screenshot_2019-02-04 Grafana - Cassandra.png

While testing a 3 node 3.11.3 cluster I noticed that the nodes were suddenly 
killed by the Linux OOM killer after running without issues for 4-5 weeks.


After enabling more metrics and leaving the nodes running for 12 days it sure 
looks like the
"java.nio:type=BufferPool,name=direct" Mbean shows a very linear growth (approx 
15MiB/24h, see attached screenshot). Is this expected to keep growing linearly 
after 12 days with a constant load?

 

In my setup the growth/leak is about 15MiB/day so I guess in most setups it 
would take quite a few days until it becomes noticeable. I'm able to see the 
same type of slow growth in other production clusters even though the graph 
data is more noisy.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to