[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-06-01 Thread Weijie Guo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17728363#comment-17728363 ] Weijie Guo commented on FLINK-31974: master(1.18) via 3b9f7cf8ffcd357f252f62dee62d26dbc6a76e91.

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-30 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17727499#comment-17727499 ] Gyula Fora commented on FLINK-31974: No worries, I will assign it to myself and will work on this

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-30 Thread Weijie Guo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17727382#comment-17727382 ] Weijie Guo commented on FLINK-31974: [~gyfora] Sorry, I am quite busy recently, feel free to

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-30 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17727355#comment-17727355 ] Gyula Fora commented on FLINK-31974: [~Weijie Guo] are you working on this ticket? > JobManager

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-05 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719744#comment-17719744 ] Xintong Song commented on FLINK-31974: -- Thanks all for the explanation and patience. It seems

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-05 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719727#comment-17719727 ] Matthias Pohl commented on FLINK-31974: --- [~sergiosp] I guess it's not necessary to provide the

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719580#comment-17719580 ] Xintong Song commented on FLINK-31974: -- cc [~wangyang0918] > JobManager crashes after

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Sergio Sainz (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719525#comment-17719525 ] Sergio Sainz commented on FLINK-31974: -- Hi [~mapohl] - let me setup a new cluster later on to get

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Thomas Weise (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719355#comment-17719355 ] Thomas Weise commented on FLINK-31974: -- There are many cases where errors are transient. This

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719331#comment-17719331 ] Xintong Song commented on FLINK-31974: -- [~gyfora], bq. Flink treats only very few errors fatal. IO

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Jira
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719275#comment-17719275 ] Márton Balassi commented on FLINK-31974: In the specific case I much prefer the behaviour

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719249#comment-17719249 ] Gyula Fora commented on FLINK-31974: cc [~mbalassi] [~mxm] [~thw]  > JobManager crashes after

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719247#comment-17719247 ] Gyula Fora commented on FLINK-31974: Flink treats only very few errors fatal. IO errors, connector

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719228#comment-17719228 ] Xintong Song commented on FLINK-31974: -- [~gyfora], IMO, errors that Flink cannot recover from by

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719213#comment-17719213 ] Gyula Fora commented on FLINK-31974: [~xtsong] what errors would you consider actually fatal in

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719209#comment-17719209 ] Xintong Song commented on FLINK-31974: -- Not sure about never giving fatal exceptions. I personally

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Gyula Fora (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719196#comment-17719196 ] Gyula Fora commented on FLINK-31974: Somewhat of a side comment: I think in native kubernetes

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719187#comment-17719187 ] Xintong Song commented on FLINK-31974: -- [~mapohl], There're two paths for JobMaster to handle the

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719176#comment-17719176 ] Matthias Pohl commented on FLINK-31974: --- Sounds good to me, too. Just for me to understand: With

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-03 Thread Weijie Guo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719144#comment-17719144 ] Weijie Guo commented on FLINK-31974: Thanks Xintong for the analysis and proposal, It makes sense to

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-03 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719116#comment-17719116 ] Xintong Song commented on FLINK-31974: -- Thanks [~sergiosp] for reporting, and thanks [~mapohl] for

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-03 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718964#comment-17718964 ] Matthias Pohl commented on FLINK-31974: --- I'm still wondering what the desired behavior in that

[jira] [Commented] (FLINK-31974) JobManager crashes after KubernetesClientException exception with FatalExitExceptionHandler

2023-05-03 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-31974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718958#comment-17718958 ] Matthias Pohl commented on FLINK-31974: --- Thanks for reporting. This is caused by the changes that