dcausse created this task.
dcausse added projects: Wikidata-Query-Service, Wikidata, Discovery-Search 
(Current work).

TASK DESCRIPTION
  As a maintainer of a flink session cluster I want to be alerted when the 
number of taskmanagers is not what the deployment expects so that I can react 
quickly.
  
  It may happen that k8s is preferring to reboot containers on a broken k8s 
node rather than migrate the pod to a new pod (see parent ticket), for k8s this 
deployment may appear to be working properly but for flink the resources it 
expects are not available and the job it's supposed to run will remain in the 
SCHEDULED state.
  
  AC:
  
  - alert when the number of task managers is below a certain threshold

TASK DETAIL
  https://phabricator.wikimedia.org/T305068

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: Aklapper, Michael, JMeybohm, Addshore, toan, bking, RKemper, Gehel, 
akosiaris, elukey, dcausse, Astuthiodit_1, karapayneWMDE, Invadibot, MPhamWMF, 
maantietaja, CBogen, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to