jens-scheffler-bosch commented on issue #32586: URL: https://github.com/apache/airflow/issues/32586#issuecomment-1634841750
Hi @BShraman thanks for filing this issue. I marked it as improvement as it something that enhances the situation. I understand that users expect a very reliable platform. To better understand your demand, can you specify a bit more your expectation? Would you "just log" internal warnings? Shall logs be structured and machine readable? To be integrated in any kind of (existing?) monitoring solution or just exposing data? Do you expect also some kind of alarming? If yes, how? Can you describe what kind of disruptions and impact you have noticed or are users afraid of? Do you have a dedicated support for the platform? Are you searching for (commercial) support delivering an SLA? Or do you expect data engineers to fix themself? Do you have an operations team? Are you using a Kubernetes and the standard helm chart which includes Liveness-probes for standard self-healing? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
