The GitHub Actions job "Tests" on airflow.git has failed.
Run started by GitHub user potiuk (triggered by potiuk).

Head commit for run:
41ae04001c702d4754579434891442995d4be065 / Jarek Potiuk <[email protected]>
Improve debuggability of K8S tests run in CI

The K8S tests when run in parallel for multiple versions require:

a) vast amount of memory to keep all the cluster running
b) extra time to startup for all k8s tests

Failure in CI for K8S jobs produced really long logs that
were next to impossible to analyse any problems (for example
problems involved with lack of memory or with overwhelming K8S
with too many requests when there were multiple clusters and
multiple tests runnning (Connection Reset by Peer).

The change introduces improved resource debugging and increased timeouts
and tries on webserver health so that all k8S have enough time and
resources to run.

The resource dump from K8S is dumped to separate file, otherwise
it makes it impossible to analyse logs in Github UI and clutters
enormously the output of failed k8s tests.

Those resource dumps can be downloaded separately as they are uploaded
as artifacts in case of failure.

Assertion which was thrown in "setUp" of K8S test base did not give
too much information about what the problem was so it has been now
replaced with a much more user-friendly message showing the actual
reason (could not check health of the webserver) and you can see
"connection reset by peer" alongside the error showing the root
of the problem.

Report URL: https://github.com/apache/airflow/actions/runs/3210729410

With regards,
GitHub Actions via GitBox


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to