[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2021-04-29 Thread Flink Jira Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17336302#comment-17336302 ] Flink Jira Bot commented on FLINK-16215: This issue was labeled "stale-major" 7 ago and has not

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2021-04-22 Thread Flink Jira Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17328033#comment-17328033 ] Flink Jira Bot commented on FLINK-16215: This major issue is unassigned and itself and all of

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-03-03 Thread Yangze Guo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050678#comment-17050678 ] Yangze Guo commented on FLINK-16215: [~liuyufei] Hi, after a deeper investigation, we believe

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-27 Thread YufeiLiu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046515#comment-17046515 ] YufeiLiu commented on FLINK-16215: -- [~xintongsong] LGTM. I think can handle almost all scenarios. And I

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-27 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046417#comment-17046417 ] Xintong Song commented on FLINK-16215: -- [~liuyufei], Is it possible that we first assume all the

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-27 Thread YufeiLiu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046335#comment-17046335 ] YufeiLiu commented on FLINK-16215: -- [~xintongsong] I'm thinking about to reuse the slots will be

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046241#comment-17046241 ] Xintong Song commented on FLINK-16215: -- I believe the problem of identifying recovered containers

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046207#comment-17046207 ] Xintong Song commented on FLINK-16215: -- Just trying to understand, why do we need to block

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread Yangze Guo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046198#comment-17046198 ] Yangze Guo commented on FLINK-16215: [~liuyufei] SlotManager will decide what and how many

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread YufeiLiu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046195#comment-17046195 ] YufeiLiu commented on FLINK-16215: -- [~xintongsong] This sounds good. We can do some actual recovery

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046062#comment-17046062 ] Xintong Song commented on FLINK-16215: -- [~liuyufei] I see your point. You mean for FLINK-15959, in

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-26 Thread YufeiLiu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17045565#comment-17045565 ] YufeiLiu commented on FLINK-16215: -- [~xintongsong] I understand your concern, so we can't know how many

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-23 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17043137#comment-17043137 ] Xintong Song commented on FLINK-16215: -- I share [~trohrmann]'s concern. On Yarn deployment,

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-23 Thread Yang Wang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17043134#comment-17043134 ] Yang Wang commented on FLINK-16215: --- I think even we make {{recoverWokerNode}} as interface and do the

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-21 Thread Till Rohrmann (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17042036#comment-17042036 ] Till Rohrmann commented on FLINK-16215: --- I think at the moment I would not recommend to do it

[jira] [Commented] (FLINK-16215) Start redundant TaskExecutor when JM failed

2020-02-21 Thread Andrey Zagrebin (Jira)
[ https://issues.apache.org/jira/browse/FLINK-16215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17041869#comment-17041869 ] Andrey Zagrebin commented on FLINK-16215: - I assume it is about some active RM integration, e.g.