potiuk commented on code in PR #28256:
URL: https://github.com/apache/airflow/pull/28256#discussion_r1057903720
##########
airflow/dag_processing/manager.py:
##########
@@ -777,8 +777,9 @@ def clear_nonexistent_import_errors(self, session):
:param session: session for ORM operations
"""
query = session.query(errors.ImportError)
- if self._file_paths:
- query =
query.filter(~errors.ImportError.filename.in_(self._file_paths))
+ files = list_py_file_paths(self._dag_directory,
include_examples=False, include_zip_paths=True)
Review Comment:
Do we have an idea how to do it better? I have not looked at all the
processing paths that are involved, but I am not sure how "expensive" this is -
seems to be done once per loop and something that we have to do anyway
periodically (as we cannot really rely on inotify kind of API due to various
filesystems DAG folder can be on.
Maybe this check should happen independently on another theread in-parallel
to parsing ? Any other ideas @ephraimbuddy ?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]