potiuk commented on PR #39484: URL: https://github.com/apache/airflow/pull/39484#issuecomment-2134339192
> There is no change. It seems that the resource leak is caused by the client rather than the server. Is this possible that you have a network proxy (might be transparent, provided by Kubernetes or other networking configuration) that gets slower and slower as more and more connections are opened? Maybe simply your proxy does not properly clean opened connections to rabbitmq? Do you see a growht in other resources used by any of the components of the system? Maybe - for some reason - there is a problem with killing the processes that are forked and they remain as zombies ? Also - do you happen to have some CPU limits set for the Scheduler POD from the deployment point of view (like K8S CPU limits?) I really would love to understand where the slow-down is coming from -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
