Re: [prometheus-users] Prometheus with remote_write VM dies with OOM: remote_write settings problem?

Olga Chukanova Tue, 19 Jan 2021 05:08:04 -0800

Firstly - thank you for your answer and advice!
I've increased capacity to 20000, but it didn't help. Anyway, I had to
do it, I misread the documentation, thank you.
To calculate required memory, I used this calc:
https://www.robustperception.io/how-much-ram-does-prometheus-2-x-need-for-cardinality-and-ingestion
With my max_over_time(prometheus_tsdb_head_series[1d]) = 769255 and unique
label pairs (count(kube_pod_labels{app=~".*"})) 1154, Combined Memory =
3,868MiB
Plus 30% for remote_write and we have ~5.1MiB
My pod has 6Gb memory in limits. May it be not enough 1Gb in reserve? I
know if I increase the memory limit to 8Gb it can solve my problem, I just
want to be sure this is only one way.


пт, 15 янв. 2021 г. в 13:52, Aliaksandr Valialkin <[email protected]>:

> Try increasing `capacity` to 3x max_samples_per_send, i.e. to 20000 for
> your case according to https://prometheus.io/docs/practices/remote_write/
> .
>
> Prometheus may require up to 30% more memory after enabling remote_write
> according to production measurements. Make sure that your Prometheus
> instance runs on a host with at least 30% of free memory before enabling
> remote_write on it.
>
> On Fri, Jan 15, 2021 at 11:18 AM Olga Chukanova <[email protected]>
> wrote:
>
>> Hello!
>> I have prometheus like monitoring system in kubernetes, and I trying to
>> set up remote_write to victoria metrics. But I have one tragic problem - my
>> prometheus dies by OOM.
>>  I’ve tested two versions of Prometheus (v2.11.0 and v.2.23.0) and had
>> same problem on both.
>> My average value of rate(prometheus_remote_storage_samples_in_total [5m])
>> is ~75k, prometheus pod limits is cpu ‘4’ and memory 6144M and average
>> metric prometheus_remote_storage_shards = 1.
>> Settings in remote_write are:
>>         queue_config:
>>           capacity: 100
>>           max_samples_per_send: 10000
>>           max_shards: 10
>>           min_shards: 1
>> Global scrape setting:
>>     global:
>>       scrape_interval: 10s
>>       scrape_timeout: 10s
>>       evaluation_interval: 10s
>> In logs (with debug mode) I didn’t found anything, what can explayn the
>> problem.
>> I think, I’m doing something wrong in remote_write setting, but I don’t
>> understand what, and based on wich metrics I should configure that.
>> Thank you for any help!
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Prometheus Users" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/prometheus-users/830a40c0-dac9-43ca-880d-5898a2d70f50n%40googlegroups.com
>> <https://groups.google.com/d/msgid/prometheus-users/830a40c0-dac9-43ca-880d-5898a2d70f50n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> --
> Best Regards,
>
> Aliaksandr Valialkin, CTO VictoriaMetrics
>


-- 
С уважением, Ольга.

+7 (968) 760-89-95

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/CAH81ULF1XnsojRaABO8L9L8tv9hsC4_nO%2BSdeCh9VBS3-3QPrA%40mail.gmail.com.

Re: [prometheus-users] Prometheus with remote_write VM dies with OOM: remote_write settings problem?

Reply via email to