potiuk opened a new issue #14502: URL: https://github.com/apache/airflow/issues/14502
I have seen a number of cases recently when self-hosted runner build failed/was cancelled without providing any logs. Sometimes it is after 1-2 hours of running. It looks like VM running the builds is stuck and hangs and then gets killed without sending any logs. An example of that is here: https://github.com/apache/airflow/runs/1990835677?check_suite_focus=true I know the job works because it succeeded in one of the previous builds. There are no logs to investigate, It would be great if we have some way of reaching out to the logs of the machine it is running on. Currently only @ashb can investigate those problems and it puts enormous responsibility and makes @ashb Single Point of Failure in those cases. They happened 3-4 times for me over last few days so they are frequent. It would be great if several other people have access to the logs to be able to investigate it, also it would be great to get some links printed in the output of the job where we can get directly to the logs of the machine /run that was executed there. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
