potiuk commented on issue #31200:
URL: https://github.com/apache/airflow/issues/31200#issuecomment-1546700775

   I would love to get to the bottom of it. Can anyone having the problem (with 
the Airflow Community chart) run the following on their frequently failing 
scheduler:
   
   1) exec into the container of scheduler while it is running (`/entrypoint 
bash` should be the right command
   
   2) Run this command:
   
   * `date`
   *  then `AIRFLOW__LOGGING__LOGGING_LEVEL=DEBUG airflow jobs check --job-type 
SchedulerJob --local`  several times while airflow is running and save 
somewhere the output
   * if you notice that it switches from finding 1 job alive into `No alive 
jobs found.` do as follows
   *`date`
   * run `airflow db shell`
   * run 'SELECT * from job;' 
   * copy the output
   * repeat few times and see if it changes (last time seen)
   * When your scheduler gets killed because of liveness checks -> find the 
logs of scheduler from before that faiure and dump them
   * create some way (gists?) to share all the dumped information (please 
annotate and comment where the log is from)
   
   And ping me.
   
   @ptran32 @rcheatham-q @arjunanan6 - can I count on your help here? 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to