yansfil opened a new issue #17372:
URL: https://github.com/apache/airflow/issues/17372


   **Apache Airflow version**:
   2.1.0
   
   **Kubernetes version (if you are using kubernetes)** (use `kubectl version`):
   1.20
   
   **Environment**:
   gke
   
   **What happened**:
   when first initializing airflow project in 2.1.0, the time to complete 
serialization of dags is too slow.
   it takes 30 minutes to complete serialize all dags (600 dags in my project). 
because of it, I should wait to execute all dags  at the first time. 
   dag parsing process of airflow v1 is more faster because of no loading 
serialization dags to database. How can I make it more faster? 
   
   **What you expected to happen**:
   make dag serialization process more faster than now
   
   
   **How to reproduce it**:
   I assigned schedeuler resources, 1g for cpu and 1g for memory. 
   and I configured airflow.cfg like below
   
   AIRFLOW__CORE__MIN_SERIALIZED_DAG_UPDATE_INTERVAL: "5"
   AIRFLOW__CORE__MIN_SERIALIZED_DAG_FETCH_INTERVAL: "5"
   AIRFLOW__SCHEDULER__PARSING_PROCESSES: "8"
   AIRFLOW__SCHEDULER__MIN_FILE_PROCESS_INTERVAL: "0" 
   AIRFLOW__SCHEDULER__DAG_DIR_LIST_INTERVAL: "10"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to