[
https://issues.apache.org/jira/browse/FLINK-19545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17237779#comment-17237779
]
Kevin Kwon commented on FLINK-19545:
------------------------------------
[~fly_in_gis] Is https://issues.apache.org/jira/browse/FLINK-20113 related?
seems the ticket is closed now. I've also tried some e2e test on my local
machine with Kind cluster. After killing the master node, it seems the
checkpoints are starting off where it left off
I'll share the test result here if I can later. I've got many mock infra
connected as well like S3 mock and a single broker Kafka.
> Add e2e test for native Kubernetes HA
> -------------------------------------
>
> Key: FLINK-19545
> URL: https://issues.apache.org/jira/browse/FLINK-19545
> Project: Flink
> Issue Type: Sub-task
> Components: Tests
> Reporter: Yang Wang
> Assignee: Yang Wang
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.12.0
>
>
> We could use minikube for the E2E tests. Start a Flink session/application
> cluster on K8s, kill one TaskManager pod or JobManager Pod and wait for the
> job recovered from the latest checkpoint successfully.
> {panel}
> {panel}
> |{{kubectl }}{{exec}} {{-it \{pod_name} -- }}{{/bin/sh}} {{-c }}{{"kill 1"}}|
--
This message was sent by Atlassian Jira
(v8.3.4#803005)