potiuk commented on issue #31200: URL: https://github.com/apache/airflow/issues/31200#issuecomment-1546700775
I would love to get to the bottom of it. Can anyone having the problem (with the Airflow Community chart) run the following on their frequently failing scheduler: 1) exec into the container of scheduler while it is running (`/entrypoint bash` should be the right command 2) Run this command: * `date` * then `AIRFLOW__LOGGING__LOGGING_LEVEL=DEBUG airflow jobs check --job-type SchedulerJob --local` several times while airflow is running and save somewhere the output * if you notice that it switches from finding 1 job alive into `No alive jobs found.` do as follows *`date` * run `airflow db shell` * run 'SELECT * from job;' * copy the output * repeat few times and see if it changes (last time seen) * When your scheduler gets killed because of liveness checks -> find the logs of scheduler from before that faiure and dump them * create some way (gists?) to share all the dumped information (please annotate and comment where the log is from) And ping me. @ptran32 @rcheatham-q @arjunanan6 - can I count on your help here? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
