[ https://issues.apache.org/jira/browse/AIRFLOW-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776940#comment-16776940 ]
Kiran Pathak commented on AIRFLOW-2167: --------------------------------------- I like your solution. This bug is definitely a bit of frustrating :). > Scheduler's clear_nonexistent_import_errors function should be called on > first iteration > ---------------------------------------------------------------------------------------- > > Key: AIRFLOW-2167 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2167 > Project: Apache Airflow > Issue Type: Bug > Components: scheduler > Affects Versions: 1.9.0 > Reporter: Casey > Assignee: Casey > Priority: Minor > Attachments: Screen Shot 2018-03-02 at 2.08.29 PM.png > > > In `airflow/jobs.py`, the `**clear_nonexistent_import_errors` function is not > called until the amount of seconds defined by `dag_dir_list_interval` has > elapsed. If the scheduler is not alive for the duration of > `dag_dir_list_interval` (300 seconds) this cleanup never occurs. In some > environments this could result in error messages displaying on the UI > permanently, even if the DAG has been removed from the environment. > It was previously an Airflow best practice to have the scheduler run N > runtimes and terminate. Then, the scheduler would started again by an > auxiliary process like Docker or Supervisor. This situation is what brought > the bug to my attention. > My suggested fix is to tweak jobs.py to run the import error cleanup on the > first iteration and periodically as defined by `dag_dir_list_interval`. This > way, a scheduler setup with a small number of runs will still have old errors > cleaned up. > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)