[
https://issues.apache.org/jira/browse/FLINK-25839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17487921#comment-17487921
]
Yang Wang commented on FLINK-25839:
-----------------------------------
I have merged a PR to print the previous logs of failed pod. It is very useful
to debug the root cause for this ticket.
> 'Run kubernetes application HA test' failed on azure due to could not get 3
> completed checkpoints in 120 sec
> ------------------------------------------------------------------------------------------------------------
>
> Key: FLINK-25839
> URL: https://issues.apache.org/jira/browse/FLINK-25839
> Project: Flink
> Issue Type: Bug
> Components: Deployment / Kubernetes
> Affects Versions: 1.15.0
> Reporter: Yun Gao
> Priority: Critical
> Labels: pull-request-available, test-stability
>
> {code:java}
> Jan 27 02:07:33 deployment.apps/flink-native-k8s-application-ha-1 condition
> met
> Jan 27 02:07:33 Waiting for job
> (flink-native-k8s-application-ha-1-d8dc997d5-v8cpz) to have at least 3
> completed checkpoints ...
> Jan 27 02:09:45 Could not get 3 completed checkpoints in 120 sec
> Jan 27 02:09:45 Stopping job timeout watchdog (with pid=217858)
> Jan 27 02:09:45 Debugging failed Kubernetes test:
> Jan 27 02:09:45 Currently existing Kubernetes resources
> {code}
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=30261&view=logs&j=af885ea8-6b05-5dc2-4a37-eab9c0d1ab09&t=f779a55a-0ffe-5bbc-8824-3a79333d4559&l=5376
--
This message was sent by Atlassian Jira
(v8.20.1#820001)