jack created AIRFLOW-3145:
-----------------------------

             Summary: reduce airflow dag scheduling latency in production
                 Key: AIRFLOW-3145
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-3145
             Project: Apache Airflow
          Issue Type: Task
    Affects Versions: 1.10.1
            Reporter: jack


The docs says:
 * {{}}
{code:java}
max_threads: Scheduler will spawn multiple threads in parallel to schedule 
dags. This is controlled by max_threads with default value of 2. User should 
increase this value to a larger value(e.g numbers of cpus where scheduler runs 
- 1) in production.{code}

[https://airflow.apache.org/faq.html#how-can-my-airflow-dag-run-faster]

 

 

However in the Aiflow.cfg the comment is
{code:java}
#The scheduler can run multiple threads in parallel to schedule dags.
# This defines how many threads will run.
max_threads = 2{code}
 

 

May I suggest that the
{code:java}
User should increase this value to a larger value(e.g numbers of cpus where 
scheduler runs - 1) in production.{code}
 

will also be written in the comment at Aiflow.cfg 

Not everyone read the docs from top to bottom this could be easily missed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to