subject:"Out of memory and\/or OOM kill on a cluster"

Re: Out of memory and/or OOM kill on a cluster

2016-11-22 Thread Vincent Rischmann

Thanks for the detailed answer Alexander. We'll look into your suggestions, it's definitely helpful. We have plans to reduce tombstones and remove the table with the big partitions, hopefully after we've done that the cluster will be stable again. Thanks again. On Tue, Nov 22, 2016, at 09

Re: Out of memory and/or OOM kill on a cluster

2016-11-22 Thread Alexander Dejanovski

Hi Vincent, Here are a few pointers for disabling swap : - https://docs.datastax.com/en/cassandra/2.0/cassandra/install/installRecommendSettings.html - http://stackoverflow.com/questions/22988824/why-swap-needs-to-be-turned-off-in-datastax-cassandra Tombstones are definitely the kind of object th

Re: Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Vincent Rischmann

Thanks for your answer Alexander. We're writing constantly to the table, we estimate it's something like 1.5k to 2k writes per second. Some of these requests update a bunch of fields, some update fields + append something to a set. We don't read constantly from it but when we do it's a lot of re

Re: Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Alexander Dejanovski

Vincent, only the 2.68GB partition is out of bounds here, all the others (<256MB) shouldn't be much of a problem. It could put pressure on your heap if it is often read and/or compacted. But to answer your question about the 1% harming the cluster, a few big partitions can definitely be a big prob

Re: Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Vincent Rischmann

@Vladimir We tried with 12Gb and 16Gb, the problem appeared eventually too. In this particular cluster we have 143 tables across 2 keyspaces. @Alexander We have one table with a max partition of 2.68GB, one of 256 MB, a bunch with the size varying between 10MB to 100MB ~. Then there's the

Re: Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Alexander Dejanovski

Hi Vincent, one of the usual causes of OOMs is very large partitions. Could you check your nodetool cfstats output in search of large partitions ? If you find one (or more), run nodetool cfhistograms on those tables to get a view of the partition sizes distribution. Thanks On Mon, Nov 21, 2016 a

Re: Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Vladimir Yudovin

Did you try any value in the range 8-20 (e.g. 60-70% of physical memory). Also how many tables do you have across all keyspaces? Each table can consume minimum 1M of Java heap. Best regards, Vladimir Yudovin, Winguzone - Hosted Cloud Cassandra Launch your cluster in minutes. On Mon

Out of memory and/or OOM kill on a cluster

2016-11-21 Thread Vincent Rischmann

Hello, we have a 8 node Cassandra 2.1.15 cluster at work which is giving us a lot of trouble lately. The problem is simple: nodes regularly die because of an out of memory exception or the Linux OOM killer decides to kill the process. For a couple of weeks now we increased the heap to 20Gb hop

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Re: Out of memory and/or OOM kill on a cluster

Out of memory and/or OOM kill on a cluster

8 matches

Site Navigation

Mail list logo

Footer information