tillrohrmann commented on a change in pull request #18808:
URL: https://github.com/apache/flink/pull/18808#discussion_r808859824



##########
File path: 
docs/content/docs/deployment/resource-providers/standalone/kubernetes.md
##########
@@ -246,6 +246,17 @@ To use Reactive Mode on Kubernetes, follow the same steps 
as for [deploying a jo
 
 Once you have deployed the *Application Cluster*, you can scale your job up or 
down by changing the replica count in the `flink-taskmanager` deployment.
 
+### Enabling Local Recovery Across Pod Restarts
+
+In order to speed up recoveries in case of pod failures, you can leverage 
Flink's [working directory]({{< ref 
"docs/deployment/resource-providers/standalone/working_directory" >}}) feature 
together with local recovery.
+If the working directory is configured to reside on a persistent volume that 
gets remounted to a restarted TaskManager pod, then Flink is able to recover 
state locally.
+With the 
[StatefulSet](https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/),
 Kubernetes gives you the exact tool you need to map a pod to a persistent 
volume.
+
+So instead of deploying the TaskManagers as a Deployment, you need to 
configure a StatefulSet for the TaskManagers.
+The StatefulSet allows to configure a volume claim template that you use to 
mount persistent volumes to the TaskManagers.

Review comment:
       We also need a mapping between pod and persistent volume. This is not 
possible with a deployment alone because then every pod would get the same 
claim assigned.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to