[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-28 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15262586#comment-15262586 ] Varun Vasudev commented on YARN-3998: - I'm going to commit the latest patch tomorrow if no one objects.

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-27 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15260696#comment-15260696 ] Varun Vasudev commented on YARN-3998: - [~vinodkv] - do you want to review this further or can I go

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-13 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239448#comment-15239448 ] Varun Vasudev commented on YARN-3998: - The latest patch looks good to me. The points that still need to

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235395#comment-15235395 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-11 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235213#comment-15235213 ] Jun Gong commented on YARN-3998: Thanks [~vvasudev] for the review and comments! Attach a rebased patch

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-11 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234983#comment-15234983 ] Varun Vasudev commented on YARN-3998: - Thanks for the patch [~hex108]. It looks good. Few things that

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-06 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227816#comment-15227816 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227697#comment-15227697 ] Jun Gong commented on YARN-3998: [~vvasudev] Thanks for the review and comments. Attach a new patch

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-04-04 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15225716#comment-15225716 ] Varun Vasudev commented on YARN-3998: - Thanks for the patch [~hex108] # {code}+ List

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-28 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15214969#comment-15214969 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15214532#comment-15214532 ] Jun Gong commented on YARN-3998: Very sorry for late. I just a attached a new patch 07.patch. In the new

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-09 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188681#comment-15188681 ] Jun Gong commented on YARN-3998: Yes, it seems we need to deal with platform errors, however it still seems

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-09 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15187920#comment-15187920 ] Vinod Kumar Vavilapalli commented on YARN-3998: --- Instead of making it an arbitrary

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-08 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184968#comment-15184968 ] Junping Du commented on YARN-3998: -- bq. We could specify retry policy to RETRY_ON_SPECIFIC_ERROR_CODE to

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184951#comment-15184951 ] Jun Gong commented on YARN-3998: We could specify retry policy to RETRY_ON_SPECIFIC_ERROR_CODE to handle

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184952#comment-15184952 ] Jun Gong commented on YARN-3998: We could specify retry policy to RETRY_ON_SPECIFIC_ERROR_CODE to handle

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-08 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184809#comment-15184809 ] Junping Du commented on YARN-3998: -- Thanks [~hex108] for the patch. In addition, I think we should be

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184167#comment-15184167 ] Jun Gong commented on YARN-3998: Thanks [~vvasudev] and [~vinodkv] for the comments and suggestions. I will

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-07 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15183634#comment-15183634 ] Vinod Kumar Vavilapalli commented on YARN-3998: --- Okay, seems like we are in general

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-03-06 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15182570#comment-15182570 ] Varun Vasudev commented on YARN-3998: - My apologies for not responding earlier. I think creating a

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-24 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15166567#comment-15166567 ] Jun Gong commented on YARN-3998: Thanks [~vinodkv] for explaining it. {quote} My point was mainly about

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-23 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15159891#comment-15159891 ] Vinod Kumar Vavilapalli commented on YARN-3998: --- [~vvasudev], [~hex108] bq. Unification with

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-23 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15159810#comment-15159810 ] Arun Suresh commented on YARN-3998: --- By the way, Thanks a ton for raising this [~hex108].. Extremely

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-23 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15159791#comment-15159791 ] Arun Suresh commented on YARN-3998: --- Was spending some time thinking about this.. Would it make sense to

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-23 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15159768#comment-15159768 ] Vinod Kumar Vavilapalli commented on YARN-3998: --- Making this a sub-task of YARN-4725 where we

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-16 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15149654#comment-15149654 ] Jun Gong commented on YARN-3998: Sorry for late reply, I was on holiday. Thanks [~vinodkv] and [~vvasudev]

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-05 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134366#comment-15134366 ] Varun Vasudev commented on YARN-3998: - Thanks for your comments Vinod. Overall, I mostly agree with

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-04 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133367#comment-15133367 ] Vinod Kumar Vavilapalli commented on YARN-3998: --- Thanks for working on this [~hex108]! I've

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130109#comment-15130109 ] Jun Gong commented on YARN-3998: [~vvasudev], I just attached a new patch to address above problems. Thanks

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-03 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130229#comment-15130229 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127929#comment-15127929 ] Jun Gong commented on YARN-3998: Thanks [~vvasudev] for the detailed review and comments. I will update the

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127952#comment-15127952 ] Jun Gong commented on YARN-3998: Thanks for clarifying. Yes, it will be a problem, I will handle it. > Add

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-02 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127937#comment-15127937 ] Varun Vasudev commented on YARN-3998: - {quote} getLocalPathForWrite is used for allocating a new

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-01 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127734#comment-15127734 ] Varun Vasudev commented on YARN-3998: - A couple of additional thoughts - I would like to restart

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-02-01 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127732#comment-15127732 ] Varun Vasudev commented on YARN-3998: - My apologies for the slow response [~hex108]. Thank you for your

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-31 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125597#comment-15125597 ] Jun Gong commented on YARN-3998: hi [~vvasudev], could you please help review the latest patch if you have

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-15 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101922#comment-15101922 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-15 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101483#comment-15101483 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-15 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101552#comment-15101552 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-14 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101383#comment-15101383 ] Jun Gong commented on YARN-3998: [~vvasudev] Thanks for the detailed review and suggestions! I just

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2016-01-07 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15087392#comment-15087392 ] Varun Vasudev commented on YARN-3998: - Thanks for the patch [~hex108]! In your implementation, the

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072022#comment-15072022 ] Jun Gong commented on YARN-3998: Attach a new patch to fix above errors, add test cases. > Add retry-times

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-26 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072026#comment-15072026 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-24 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071166#comment-15071166 ] Hadoop QA commented on YARN-3998: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-24 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071113#comment-15071113 ] Jun Gong commented on YARN-3998: Sorry for late. I just attached a patch for review, will add test cases

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-10 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052043#comment-15052043 ] Jun Gong commented on YARN-3998: [~vvasudev] Thanks for the attention and suggestion. We have a patch for

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-10 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15051024#comment-15051024 ] Varun Vasudev commented on YARN-3998: - [~hex108] - do you wish to work on this? Do you have a patch

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-12-09 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049050#comment-15049050 ] Varun Vasudev commented on YARN-3998: - I'm thinking of re-opening this issue because we've seen a use

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651887#comment-14651887 ] Jun Gong commented on YARN-3998: Thanks [~jlowe] and [~ste...@apache.org] for the detailed

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-07-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14649347#comment-14649347 ] Jason Lowe commented on YARN-3998: -- I think it's more effort for YARN to support this than

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-07-31 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14649089#comment-14649089 ] Jun Gong commented on YARN-3998: [~jlowe] Thanks for the comment. I agree with you. I am

[jira] [Commented] (YARN-3998) Add retry-times to let NM re-launch container when it fails to run

2015-07-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647755#comment-14647755 ] Jason Lowe commented on YARN-3998: -- Is this really a feature that YARN needs to provide?