potiuk commented on issue #19192:
URL: https://github.com/apache/airflow/issues/19192#issuecomment-1180221289

   @t4n1o  - (but also others in this thread) a lot of the issues are because 
we have diffuculties with seeing clear reproduction of the problem - and we 
have to rely on users like you to spend their time on trying to analyze the 
settings, configuration, deployment they have, perform enough analysis and get 
enought clues that those who know the code best can make intelligent guesses 
what is wrong even if there is no "full and clear reproduction steps". It's not 
that development time is valuable. Airflow is developed by > 2100 contributors 
- often people like you. The source code is available, anyone can take a look 
and while some people know more about some parts of code, you get the software 
for free, and you cannot "expect" those people to spend a lot of time on trying 
to figure out what's wrong if they have no clear reproduction steps and enough 
clues.
   
   Making analysis and enough evidences to see what you observe is the best you 
can do to pay back for the free software - and possibly give those looking here 
enough clues to fix or direct you how to solve the problem.
   
   So absolutely - if you feel like looking at the code and analysing it is 
something you can offer the community as your "pay back" - this is fantastic.
   
   The scheduler Job is here: 
https://github.com/apache/airflow/blob/main/airflow/jobs/scheduler_job.py. 
   
   But we can give you more than that: 
https://www.youtube.com/watch?v=DYC4-xElccE - this is video from Airlfow Summit 
2021 where Ash explains how scheduler works - i.e. what were the design 
assumptions. And it can guide you in understanding what Scheduler Job does.
   
   Also, before you dive deep, it might well be that your set of DAGs and way 
you structure them is a problem and you can simply follow our guidelines on 
[Fine tuning your scheduler 
performance](https://airflow.apache.org/docs/apache-airflow/stable/concepts/scheduler.html?highlight=fine+tune#fine-tuning-your-scheduler-performance)
   
   
   So if you want to help - absolutely, make more analysis, look at the 
guidelines of ours, if you feel like it, dive deep into how scheduler works and 
look at the code. All that might be great way to get more clues, and evidences, 
and even if you won't be able to fix it in a PR you can give others enough 
clues that that they can find root cause and implement solutions.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to