Hi,
We have a Prometheus server which is our primary Prometheus server. we had 
a Prometheus federation which used to gather all the metrics from the main 
Prometheus every 90 seconds. we now decided to stop using federation as the 
backup solution and move the Prometheus configuration to the ansible so 
both servers can have the same configuration at the same time, the main 
Prometheus and the federation's version was 2.15.2. after changing the 
federation server's configuration and putting the same configuration as the 
primary Prometheus on it, I decided to update the binary file to 2.19.2. 
At first, I got service unavailable alert for about 10 minutes, after an 
hour, we start to get lots of alerts claiming exporter has no data. 
The problem got fixed after 30 minutes  but again we start to get lots of 
alerts about it after few hours.
when I checked the logs, I saw all jobs have "contecxt deadline exceeded" 
alert (during the problem I couldn't connect to the web interface as well) 
whereas I don't get any alert from the primary Prometheus and everything 
works fine there.
here is my systemd configuration for the secondary Prometheus:

<pre style='color:#000000;background:#ffffff;'><span style='color:#808030; 
'>[</span><span style='color:#0000e6; '>Unit</span><span 
style='color:#808030; '>]</span>
<span style='color:#797997; '>Description</span><span style='color:#808030; 
'>=</span>Prometheus
<span style='color:#797997; '>After</span><span style='color:#808030; 
'>=</span>network-online<span style='color:#800000; font-weight:bold; 
'>.</span>target

<span style='color:#808030; '>[</span><span style='color:#0000e6; 
'>Service</span><span style='color:#808030; '>]</span>
<span style='color:#797997; '>Type</span><span style='color:#808030; 
'>=</span>simple
<span style='color:#797997; '>Environment</span><span style='color:#808030; 
'>=</span><span style='color:#0000e6; '>"</span><span style='color:#797997; 
'>GOMAXPROCS</span><span style='color:#808030; '>=</span><span 
style='color:#0000e6; '>8"</span>
<span style='color:#797997; '>User</span><span style='color:#808030; 
'>=</span>prometheus
<span style='color:#797997; '>Group</span><span style='color:#808030; 
'>=</span>prometheus
<span style='color:#797997; '>ExecReload</span><span style='color:#808030; 
'>=</span><span style='color:#40015a; '>/bin/kill</span> -HUP <span 
style='color:#797997; '>$MAINPID</span>
<span style='color:#797997; '>ExecStart</span><span style='color:#808030; 
'>=</span><span style='color:#40015a; '>/usr/local/sbin/prometheus</span> 
<span style='color:#0f69ff; '>\</span>
  --<span style='color:#797997; '>config.file</span><span 
style='color:#808030; '>=</span><span style='color:#40015a; 
'>/etc/prometheus/prometheus.yml</span> <span style='color:#0f69ff; 
'>\</span>
  --<span style='color:#797997; '>storage.tsdb.path</span><span 
style='color:#808030; '>=</span><span style='color:#40015a; 
'>/var/lib/prometheus</span> <span style='color:#0f69ff; '>\</span>
  --<span style='color:#797997; '>storage.tsdb.retention.time</span><span 
style='color:#808030; '>=</span>30d <span style='color:#0f69ff; '>\</span>
  --<span style='color:#797997; '>storage.tsdb.retention.size</span><span 
style='color:#808030; '>=</span>275GB <span style='color:#0f69ff; '>\</span>
  --<span style='color:#797997; '>web.console.libraries</span><span 
style='color:#808030; '>=</span><span style='color:#40015a; 
'>/etc/prometheus/console_libraries</span> <span style='color:#0f69ff; 
'>\</span>
  --<span style='color:#797997; '>web.console.templates</span><span 
style='color:#808030; '>=</span><span style='color:#40015a; 
'>/etc/prometheus/consoles</span> <span style='color:#0f69ff; '>\</span>
  --web<span style='color:#800000; font-weight:bold; '>.</span><span 
style='color:#bb7977; font-weight:bold; '>enable</span>-admin-api <span 
style='color:#0f69ff; '>\</span>
  --<span style='color:#797997; '>web.listen-address</span><span 
style='color:#808030; '>=</span><span style='color:#008c00; '>0</span><span 
style='color:#800000; font-weight:bold; '>.</span><span 
style='color:#008c00; '>0</span><span style='color:#800000; 
font-weight:bold; '>.</span><span style='color:#008c00; '>0</span><span 
style='color:#800000; font-weight:bold; '>.</span><span 
style='color:#008c00; '>0</span><span style='color:#808030; '>:</span><span 
style='color:#008c00; '>9090</span> 

<span style='color:#797997; '>CapabilityBoundingSet</span><span 
style='color:#808030; '>=</span>CAP_SET_UID
<span style='color:#797997; '>LimitNOFILE</span><span style='color:#808030; 
'>=</span><span style='color:#008c00; '>65000</span>
<span style='color:#797997; '>LockPersonality</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>NoNewPrivileges</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>MemoryDenyWriteExecute</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>PrivateDevices</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>PrivateTmp</span><span style='color:#808030; 
'>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>ProtectHome</span><span style='color:#808030; 
'>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>RemoveIPC</span><span style='color:#808030; 
'>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>RestrictSUIDSGID</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>CPUAccounting</span><span 
style='color:#808030; '>=</span>yes
<span style='color:#797997; '>MemoryAccounting</span><span 
style='color:#808030; '>=</span>yes
<span style='color:#696969; '>#SystemCallFilter=@signal @timer</span>

<span style='color:#797997; '>ReadWritePaths</span><span 
style='color:#808030; '>=</span><span style='color:#40015a; 
'>/var/lib/prometheus</span>

<span style='color:#797997; '>PrivateUsers</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>ProtectControlGroups</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>ProtectKernelModules</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>ProtectKernelTunables</span><span 
style='color:#808030; '>=</span><span style='color:#44aadd; '>true</span>
<span style='color:#797997; '>ProtectSystem</span><span 
style='color:#808030; '>=</span>strict


<span style='color:#797997; '>SyslogIdentifier</span><span 
style='color:#808030; '>=</span>prometheus
<span style='color:#797997; '>Restart</span><span style='color:#808030; 
'>=</span>always

<span style='color:#808030; '>[</span><span style='color:#0000e6; 
'>Install</span><span style='color:#808030; '>]</span>
<span style='color:#797997; '>WantedBy</span><span style='color:#808030; 
'>=</span>multi-user<span style='color:#800000; font-weight:bold; 
'>.</span>target
</pre>
<!--Created using ToHtml.com on 2020-06-30 16:52:20 UTC -->

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/5362956f-268d-477e-9d9b-0146e73f2fden%40googlegroups.com.

Reply via email to