[
https://issues.apache.org/jira/browse/FLINK-19545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17237796#comment-17237796
]
Yang Wang commented on FLINK-19545:
-----------------------------------
[~ksp0422] Do you mean that you kill the JobManager and the new JobManager
could not recover from the latest successful checkpoint? Please share us more
input if you could.
If it is really a valid issue, we could reopen the ticket FLINK-20133.
> Add e2e test for native Kubernetes HA
> -------------------------------------
>
> Key: FLINK-19545
> URL: https://issues.apache.org/jira/browse/FLINK-19545
> Project: Flink
> Issue Type: Sub-task
> Components: Tests
> Reporter: Yang Wang
> Assignee: Yang Wang
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.12.0
>
>
> We could use minikube for the E2E tests. Start a Flink session/application
> cluster on K8s, kill one TaskManager pod or JobManager Pod and wait for the
> job recovered from the latest checkpoint successfully.
> {panel}
> {panel}
> |{{kubectl }}{{exec}} {{-it \{pod_name} -- }}{{/bin/sh}} {{-c }}{{"kill 1"}}|
--
This message was sent by Atlassian Jira
(v8.3.4#803005)