Hi All,

Need help here...

we are facing OOM issue , once after upgrading prometheus to 2.18.0.

All of sudden , we are seeing spike in memory and reaching more than limit 
whatever we have specified.

Not seeing any error in the logs , other than restart logs.

What could be the reason or how to debug this issue? Please help us here.

EKS version -- 1.14
Prometheus version -- 2.18.0

Below are the logs :

level=info ts=2020-10-27T06:31:17.396Z caller=main.go:337 msg="Starting 
Prometheus" version="(version=2.18.0, branch=HEAD, 
revision=a12e96299dcd159ea09b260f1a21e7e4b86e011d)"
level=info ts=2020-10-27T06:31:17.396Z caller=main.go:338 
build_context="(go=go1.14.2, user=root@7fbcff55abdb, 
date=20200505-14:26:04)"
level=info ts=2020-10-27T06:31:17.396Z caller=main.go:339 
host_details="(Linux 4.14.181-140.257.amzn2.x86_64 #1 
<https://github.com/prometheus/prometheus/pull/1> SMP Wed May 27 02:17:36 
UTC 2020 x86_64 prometheus-prod-prometheus-server-5688bd7769-xrsrt (none))"
level=info ts=2020-10-27T06:31:17.397Z caller=main.go:340 
fd_limits="(soft=1048576, hard=1048576)"
level=info ts=2020-10-27T06:31:17.397Z caller=main.go:341 
vm_limits="(soft=unlimited, hard=unlimited)"
level=info ts=2020-10-27T06:31:17.398Z caller=query_logger.go:79 
component=activeQueryTracker
level=info ts=2020-10-27T06:31:17.399Z caller=main.go:677 msg="Starting 
TSDB ..."
level=info ts=2020-10-27T06:31:17.399Z caller=web.go:523 component=web 
msg="Start listening for connections" address=0.0.0.0:9090
level=info ts=2020-10-27T06:31:17.402Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1588053600000 maxt=1588636800000 
ulid=01E7H5KPWT66H8PK549SPHDMJ7
level=info ts=2020-10-27T06:31:17.403Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1588636800000 maxt=1589220000000 
ulid=01E82HS5EJ9WW34XGN83WPEQRX
level=info ts=2020-10-27T06:31:17.404Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1589220000000 maxt=1589803200000 
ulid=01E8KXZBSMVA7D592S051QNQH6
level=info ts=2020-10-27T06:31:17.406Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1589803200000 maxt=1590386400000 
ulid=01E95A5E1Q8KCQKPG0ZKB1QRP6
level=info ts=2020-10-27T06:31:17.407Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1590386400000 maxt=1590969600000 
ulid=01E9PPBGNYR73Z97V1V38B7SV5
level=info ts=2020-10-27T06:31:17.408Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1590969600000 maxt=1591552800000 
ulid=01EA82H5042JTYAB8Z24RY2G9E
level=info ts=2020-10-27T06:31:17.409Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1591552800000 maxt=1592136000000 
ulid=01EASEQ60C4J84ZEY9R1QSASKS
level=info ts=2020-10-27T06:31:17.410Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1592136000000 maxt=1592719200000 
ulid=01EBATWN0KEX63VMFH12XGV83M
level=info ts=2020-10-27T06:31:17.411Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1592719200000 maxt=1593302400000 
ulid=01EBWE35327YZVVB1KGTJ9NPJX
level=info ts=2020-10-27T06:31:17.412Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1593302400000 maxt=1593885600000 
ulid=01ECDT970HG6WBZTQVKJFKPRAE
level=info ts=2020-10-27T06:31:17.414Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1593885600000 maxt=1594468800000 
ulid=01ECZ6GREX8ZPWJB00K1BJ74BX
level=info ts=2020-10-27T06:31:17.415Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=1594468800000 maxt=1595052000000 
ulid=01EDGJSABWS48E4GWZ5WBVHAQM
level=info ts=2020-10-27T06:31:17.416Z caller=repair.go:59 component=tsdb 
msg="Found healthy block" mint=159505

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/cd49223d-8fd2-416f-a382-f06d0a7beff4n%40googlegroups.com.

Reply via email to