Re: C* 2.1.2 invokes oom-killer

Michał Łowicki Mon, 23 Feb 2015 02:16:51 -0800

After couple of days it's still behaving fine. Case closed.

On Thu, Feb 19, 2015 at 11:15 PM, Michał Łowicki <mlowi...@gmail.com> wrote:


> Upgrade to 2.1.3 seems to help so far. After ~12 hours total memory
> consumption grew from 10GB to 10.5GB.
>
> On Thu, Feb 19, 2015 at 2:02 PM, Carlos Rolo <r...@pythian.com> wrote:
>
>> Then you are probably hitting a bug... Trying to find out in Jira. The
>> bad news is the fix is only to be released on 2.1.4. Once I find it out I
>> will post it here.
>>
>> Regards,
>>
>> Carlos Juzarte Rolo
>> Cassandra Consultant
>>
>> Pythian - Love your data
>>
>> rolo@pythian | Twitter: cjrolo | Linkedin: *linkedin.com/in/carlosjuzarterolo
>> <http://linkedin.com/in/carlosjuzarterolo>*
>> Tel: 1649
>> www.pythian.com
>>
>> On Thu, Feb 19, 2015 at 12:16 PM, Michał Łowicki <mlowi...@gmail.com>
>> wrote:
>>
>>> |trickle_fsync| has been enabled for long time in our settings (just
>>> noticed):
>>>
>>> trickle_fsync: true
>>>
>>> trickle_fsync_interval_in_kb: 10240
>>>
>>> On Thu, Feb 19, 2015 at 12:12 PM, Michał Łowicki <mlowi...@gmail.com>
>>> wrote:
>>>
>>>>
>>>>
>>>> On Thu, Feb 19, 2015 at 11:02 AM, Carlos Rolo <r...@pythian.com> wrote:
>>>>
>>>>> Do you have trickle_fsync enabled? Try to enable that and see if it
>>>>> solves your problem, since you are getting out of non-heap memory.
>>>>>
>>>>> Another question, is always the same nodes that die? Or is 2 out of 4
>>>>> that die?
>>>>>
>>>>
>>>> Always the same nodes. Upgraded to 2.1.3 two hours ago so we'll monitor
>>>> if maybe issue has been fixed there. If not will try to enable
>>>> |tricke_fsync|
>>>>
>>>>
>>>>>
>>>>> Regards,
>>>>>
>>>>> Carlos Juzarte Rolo
>>>>> Cassandra Consultant
>>>>>
>>>>> Pythian - Love your data
>>>>>
>>>>> rolo@pythian | Twitter: cjrolo | Linkedin: 
>>>>> *linkedin.com/in/carlosjuzarterolo
>>>>> <http://linkedin.com/in/carlosjuzarterolo>*
>>>>> Tel: 1649
>>>>> www.pythian.com
>>>>>
>>>>> On Thu, Feb 19, 2015 at 10:49 AM, Michał Łowicki <mlowi...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> On Thu, Feb 19, 2015 at 10:41 AM, Carlos Rolo <r...@pythian.com>
>>>>>> wrote:
>>>>>>
>>>>>>> So compaction doesn't seem to be your problem (You can check with
>>>>>>> nodetool compactionstats just to be sure).
>>>>>>>
>>>>>>
>>>>>> pending tasks: 0
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> How much is your write latency on your column families? I had OOM
>>>>>>> related to this before, and there was a tipping point around 70ms.
>>>>>>>
>>>>>>
>>>>>> Write request latency is below 0.05 ms/op (avg). Checked with
>>>>>> OpsCenter.
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> BR,
>>>>>> Michał Łowicki
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> BR,
>>>> Michał Łowicki
>>>>
>>>
>>>
>>>
>>> --
>>> BR,
>>> Michał Łowicki
>>>
>>
>>
>> --
>>
>>
>>
>>
>
>
> --
> BR,
> Michał Łowicki
>



-- 
BR,
Michał Łowicki

Re: C* 2.1.2 invokes oom-killer

Reply via email to