potiuk commented on issue #16025: URL: https://github.com/apache/airflow/issues/16025#issuecomment-855243894
@maxcountryman (and possibly @7yl4r ) - it's next to impossible to help without more information. I think what would be extremely helpful to investigate that one is to take a look at the logs of your worker. I think what would give more information is to setup container insights in your ECS Fargate cluster, including Firelens sending logs to CloudWatch. This way you could see more details for your failing instances - metrics, logs etc. https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/deploy-container-insights-ECS.html Then - with more information we could probably help to pin-point the problems. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
