dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION RdfStreamingUpdaterHighConsumerUpdateLag is an alert that should be fired when WDQS or WCQS machine has a lag greater than 10 minutes. Looking over [[ https://thanos.wikimedia.org/graph?g0.expr=wdqs_streaming_updater_kafka_stream_consumer_lag_Value%20%2F%201000%20%3E%20600&g0.tab=0&g0.stacked=0&g0.range_input=8w&g0.max_source_resolution=0s&g0.deduplicate=1&g0.partial_response=0&g0.store_matches=%5B%5D&g0.end_input=2022-09-01%2017%3A16%3A22&g0.moment_input=2022-09-01%2017%3A16%3A22 | historical data ]] it should have been fired on 2022-07-27 and 2022-08-08 but it looks like they were not fired: | | - [[ alert history | https://logstash.wikimedia.org/app/dashboards#/view/8b1907c0-2062-11ec-85b7-9d1831ce7631?_g=(filters:!(),refreshInterval:(pause:!t,value:0),time:(from:now-9M,to:now))&_a=(description:'Investigate%20alert%20trends%20from%20Icinga%20and%20Alertmanager.',filters:!(),fullScreenMode:!f,options:(hidePanelTitles:!f,useMargins:!t),query:(language:kuery,query:RdfStreamingUpdaterHighConsumerUpdateLag),timeRestore:!t,title:'Alerts%20overview',viewMode:view) ]] - SAL for 20220808 <https://wm-bot.wmflabs.org/libera_logs/%23wikimedia-operations/20220808.txt> - SAL for 20220727 <https://wm-bot.wmflabs.org/libera_logs/%23wikimedia-operations/20220727.txt> AC: - this alert should be fired when one WDQS or WCQS has a lag greater than 10minutes. TASK DETAIL https://phabricator.wikimedia.org/T316882 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Aklapper, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
