mtraynham commented on issue #21265:
URL: https://github.com/apache/airflow/issues/21265#issuecomment-1030250466


   We are seeing similar issues after an upgrade from 2.1.4 to 2.2.3.  On our 
scheduler, this has been somewhat mitigated as we increased 
`AIRFLOW__SCHEDULER__DAG_DIR_LIST_INTERVAL` and 
`AIRFLOW__SCHEDULER__MIN_FILE_PROCESS_INTERVAL` both to 10 minutes from 5 
seconds.
   
   However I am still seeing it in our Celery Worker and it effectively blocks 
tasks from running for 80+ seconds, before it times out.
   
   I suspect our error is related to `Initializing Providers 
Manager[import_all_hooks]` which is something new with 2.2.X and it largely 
seems to always fail with `Exception when importing 
'airflow.providers.docker.hooks.docker.DockerHook' from 
'apache-airflow-providers-docker' package.  Our log is below.
   
   @dcardinha @WattsInABox , do either of you have the option to turn on DEBUG 
logging, possibly to see something similar?
   
   ```
   2022-02-04T08:14:30.745502939Z [2022-02-04 08:14:30,745: 
INFO/ForkPoolWorker-15] Filling up the DagBag from 
/home/airflow/.local/lib/python3.8/site-packages/foobar/runner/__init__.py
   2022-02-04T08:14:30.746292606Z [2022-02-04 08:14:30,746: 
DEBUG/ForkPoolWorker-15] Importing 
/home/airflow/.local/lib/python3.8/site-packages/foobar/runner/__init__.py
   2022-02-04T08:14:31.084507410Z [2022-02-04 08:14:31,084: 
DEBUG/ForkPoolWorker-15] Initializing Providers Manager[hooks]
   2022-02-04T08:14:31.084556312Z [2022-02-04 08:14:31,084: 
DEBUG/ForkPoolWorker-15] Initializing Providers Manager[list]
   2022-02-04T08:14:31.095732961Z [2022-02-04 08:14:31,095: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.grpc.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-grpc
   2022-02-04T08:14:31.097942644Z [2022-02-04 08:14:31,097: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.docker.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-docker
   2022-02-04T08:14:31.100267595Z [2022-02-04 08:14:31,100: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.google.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-google
   2022-02-04T08:14:31.104712785Z [2022-02-04 08:14:31,104: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.cncf.kubernetes.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package 
apache-airflow-providers-cncf-kubernetes
   2022-02-04T08:14:31.106863937Z [2022-02-04 08:14:31,106: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.http.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-http
   2022-02-04T08:14:31.109014206Z [2022-02-04 08:14:31,108: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.redis.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-redis
   2022-02-04T08:14:31.111201541Z [2022-02-04 08:14:31,110: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.mysql.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-mysql
   2022-02-04T08:14:31.113902110Z [2022-02-04 08:14:31,113: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.amazon.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-amazon
   2022-02-04T08:14:31.116398514Z [2022-02-04 08:14:31,116: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.elasticsearch.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package 
apache-airflow-providers-elasticsearch
   2022-02-04T08:14:31.118252974Z [2022-02-04 08:14:31,118: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.ftp.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-ftp
   2022-02-04T08:14:31.119932342Z [2022-02-04 08:14:31,119: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.sftp.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-sftp
   2022-02-04T08:14:31.121772776Z [2022-02-04 08:14:31,121: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.ssh.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-ssh
   2022-02-04T08:14:31.123516831Z [2022-02-04 08:14:31,123: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.sendgrid.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-sendgrid
   2022-02-04T08:14:31.125036267Z [2022-02-04 08:14:31,124: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.hashicorp.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-hashicorp
   2022-02-04T08:14:31.127102702Z [2022-02-04 08:14:31,126: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.microsoft.azure.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package 
apache-airflow-providers-microsoft-azure
   2022-02-04T08:14:31.131129199Z [2022-02-04 08:14:31,130: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.slack.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-slack
   2022-02-04T08:14:31.135302807Z [2022-02-04 08:14:31,135: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.celery.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-celery
   2022-02-04T08:14:31.136855733Z [2022-02-04 08:14:31,136: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.imap.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-imap
   2022-02-04T08:14:31.138380123Z [2022-02-04 08:14:31,138: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.sqlite.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-sqlite
   2022-02-04T08:14:31.140693159Z [2022-02-04 08:14:31,140: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.odbc.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-odbc
   2022-02-04T08:14:31.142356555Z [2022-02-04 08:14:31,142: 
DEBUG/ForkPoolWorker-15] Loading EntryPoint(name='provider_info', 
value='airflow.providers.postgres.get_provider_info:get_provider_info', 
group='apache_airflow_provider') from package apache-airflow-providers-postgres
   2022-02-04T08:14:31.146554304Z [2022-02-04 08:14:31,146: 
DEBUG/ForkPoolWorker-15] Initialization of Providers Manager[list] took 0.06 
seconds
   2022-02-04T08:14:31.146788306Z [2022-02-04 08:14:31,146: 
DEBUG/ForkPoolWorker-15] Initialization of Providers Manager[hooks] took 0.06 
seconds
   2022-02-04T08:14:31.146835613Z [2022-02-04 08:14:31,146: 
DEBUG/ForkPoolWorker-15] Initializing Providers Manager[import_all_hooks]
   2022-02-04T08:16:06.682682831Z [2022-02-04 08:16:06,681: 
ERROR/ForkPoolWorker-15] Process timed out, PID: 9579
   2022-02-04T08:16:06.683022858Z [2022-02-04 08:16:06,682: 
WARNING/ForkPoolWorker-15] Exception when importing 
'airflow.providers.docker.hooks.docker.DockerHook' from 
'apache-airflow-providers-docker' package: DagBag import timeout for 
/home/airflow/.local/lib/python3.8/site-packages/foobar/runner/__init__.py 
after 30.0s.
   2022-02-04T08:16:06.683046483Z Please take a look at these docs to improve 
your DAG import time:
   2022-02-04T08:16:06.683052855Z * 
https://airflow.apache.org/docs/apache-airflow/2.2.3/best-practices.html#top-level-python-code
   2022-02-04T08:16:06.683058750Z * 
https://airflow.apache.org/docs/apache-airflow/2.2.3/best-practices.html#reducing-dag-complexity,
 PID: 9579
   2022-02-04T08:16:07.332734255Z [2022-02-04 08:16:07,332: 
DEBUG/ForkPoolWorker-15] Exception when importing 
'airflow.providers.google.leveldb.hooks.leveldb.LevelDBHook' from 
'apache-airflow-providers-google' package: No module named 'plyvel'
   2022-02-04T08:16:07.761798052Z [2022-02-04 08:16:07,761: 
DEBUG/ForkPoolWorker-15] Initialization of Providers Manager[import_all_hooks] 
took 96.61 seconds
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to