potiuk edited a comment on issue #19957:
URL: https://github.com/apache/airflow/issues/19957#issuecomment-1011857056


   I actually spent some time few days ago looking at the mini-scheduler code 
but I could not really find a flaw there. The fact that it did not help you 
indicates that my hypothesis was unfounded, unfortunately. and maybe the reason 
was different (and the fact that it worked for @stablum was mainly a 
coincidence or some side effect of that change).
   
   @dwiajik  - it might also be that your case is a bit different - could you 
please report (maybe create a gist with a few examples of) some of the logs of 
your deadlocks - Ideally if you could send us the logs of failing scheduler and 
corresponding logs of the Postgres server from the same time - I believe it 
will be much easier to investigate if we see few examples - and the server logs 
shoud tell us exactly which two queries deadlocked and this should help us a 
lot.
   
   What we really need is somethiing in hte  /var/lib/pgsql/data/pg_log/*.log, 
there should be entries at the time when then deadlock happens that looks like 
this:
   
   ```
   ERROR:  deadlock detected
   DETAIL:  Process 21535 waits for AccessExclusiveLock on relation 342640 of 
database 41454; blocked by process 21506.
   Process 21506 waits for AccessExclusiveLock on relation 342637 of database 
41454; blocked by process 21535.
   HINT:  See server log for query details.
   CONTEXT:  SQL statement "UPDATE ..."
   ````
   
   We need ideally those and some logs around it if possible.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to