[ https://issues.apache.org/jira/browse/FLINK-26370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gyula Fora updated FLINK-26370: ------------------------------- Description: In the current architecture calls to the flink clusters (through the rest client) are made synchronously from the reconcile loop. These calls often take a long time due to various (compeltely normal) reasons: - Cluster is not ready -> long call + timeoutexception - Operation takes a long time -> cancel/savepoint operations are often expected to take seconds/minutes Both the observer and reconciler components make these calls. We should come up with a way to avoid making these sync calls from the main loop while still preserving the logic of the operator. was: In the current architecture calls to the flink clusters (through the rest client) are made synchronously from the reconcile loop. These calls often take a long time due to various (compeltely normal) reasons: - Cluster is not ready -> long call + timeoutexception - Operation takes a long time -> cancel/savepoint operations are often expected to take seconds/minutes We should come up with a way to avoid making these sync calls from the main loop while still preserving the logic of the operator. > Make Flink cluster communication asynchronous > --------------------------------------------- > > Key: FLINK-26370 > URL: https://issues.apache.org/jira/browse/FLINK-26370 > Project: Flink > Issue Type: Sub-task > Components: Kubernetes Operator > Reporter: Gyula Fora > Priority: Major > > In the current architecture calls to the flink clusters (through the rest > client) are made synchronously from the reconcile loop. > These calls often take a long time due to various (compeltely normal) reasons: > - Cluster is not ready -> long call + timeoutexception > - Operation takes a long time -> cancel/savepoint operations are often > expected to take seconds/minutes > Both the observer and reconciler components make these calls. > We should come up with a way to avoid making these sync calls from the main > loop while still preserving the logic of the operator. -- This message was sent by Atlassian Jira (v8.20.1#820001)