[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-19 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593852#comment-14593852 ] Vinod Kumar Vavilapalli commented on YARN-3811: --- bq. For NM work-preserving

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-17 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590164#comment-14590164 ] Vinod Kumar Vavilapalli commented on YARN-3811: --- bq. We should also consider

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590207#comment-14590207 ] Jason Lowe commented on YARN-3811: -- bq. this is not possible to do as the NM needs to

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-17 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590319#comment-14590319 ] Jian He commented on YARN-3811: --- bq. this is not possible to do as the NM needs to report the

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589893#comment-14589893 ] Jason Lowe commented on YARN-3811: -- I agree with Jian that we probably don't need the not

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589888#comment-14589888 ] Karthik Kambatla commented on YARN-3811: We should also consider graceful NM

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588984#comment-14588984 ] Karthik Kambatla commented on YARN-3811: We ran into this in our rolling upgrade

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588995#comment-14588995 ] Karthik Kambatla commented on YARN-3811: The issue is with counting

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589016#comment-14589016 ] Vinod Kumar Vavilapalli commented on YARN-3811: --- This is a long standing

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589256#comment-14589256 ] Jian He commented on YARN-3811: --- I'm actually thinking do we still need the

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589240#comment-14589240 ] Vinod Kumar Vavilapalli commented on YARN-3811: --- bq. I kind of agree, but

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589059#comment-14589059 ] Karthik Kambatla commented on YARN-3811: This wasn't as big an issue without

[jira] [Commented] (YARN-3811) NM restarts could lead to app failures

2015-06-16 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589098#comment-14589098 ] Karthik Kambatla commented on YARN-3811: By the way, here is the stack trace: