[jira] [Updated] (YARN-1572) Low chance to hit NPE issue in AppSchedulingInfo#allocateNodeLocal

2014-11-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated YARN-1572: -- Description: we have lower chance to hit NPE in allocateNodeLocal when run benchmark(hit 4 in 20 times)

[jira] [Assigned] (YARN-2322) Provide Cli to refesh Admin Acls for Timeline server

2014-11-17 Thread Varun Saxena (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Varun Saxena reassigned YARN-2322: -- Assignee: Varun Saxena > Provide Cli to refesh Admin Acls for Timeline server >

[jira] [Commented] (YARN-2718) Create a CompositeConatainerExecutor that combines DockerContainerExecutor and DefaultContainerExecutor

2014-11-17 Thread Chun Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214690#comment-14214690 ] Chun Chen commented on YARN-2718: - [~ashahab], if you don't mind, can I work on this. I hav

[jira] [Updated] (YARN-2871) TestRMRestart#testRMRestartGetApplicationList sometime fails in trunk

2014-11-17 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated YARN-2871: - Description: >From trunk build #746 (https://builds.apache.org/job/Hadoop-Yarn-trunk/746): {code} Failed tests: T

[jira] [Created] (YARN-2871) TestRMRestart#testRMRestartGetApplicationList sometime fails in trunk

2014-11-17 Thread Ted Yu (JIRA)
Ted Yu created YARN-2871: Summary: TestRMRestart#testRMRestartGetApplicationList sometime fails in trunk Key: YARN-2871 URL: https://issues.apache.org/jira/browse/YARN-2871 Project: Hadoop YARN Issu

[jira] [Commented] (YARN-2578) NM does not failover timely if RM node network connection fails

2014-11-17 Thread Harsh J (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214806#comment-14214806 ] Harsh J commented on YARN-2578: --- bq. We never implemented health monitoring like in ZKFC with

[jira] [Assigned] (YARN-2043) Rename internal names to being Timeline Service instead of application history

2014-11-17 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naganarasimha G R reassigned YARN-2043: --- Assignee: Naganarasimha G R > Rename internal names to being Timeline Service instead

[jira] [Commented] (YARN-2522) AHSClient may be not necessary

2014-11-17 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214885#comment-14214885 ] Naganarasimha G R commented on YARN-2522: - As per discussion in [YARN-2838] Need to

[jira] [Commented] (YARN-2862) RM might not start if the machine was hard shutdown and FileSystemRMStateStore was used

2014-11-17 Thread Ming Ma (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214895#comment-14214895 ] Ming Ma commented on YARN-2862: --- Thanks, [~jira.shegalov], [~jianhe], [~zjshen]. I am able t

[jira] [Commented] (YARN-2838) Issues with TimeLineServer (Application History)

2014-11-17 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214899#comment-14214899 ] Naganarasimha G R commented on YARN-2838: - Thanks [~zjshen] for reviewing and guidi

[jira] [Resolved] (YARN-2838) Issues with TimeLineServer (Application History)

2014-11-17 Thread Zhijie Shen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhijie Shen resolved YARN-2838. --- Resolution: Not a Problem Close the ticket and work on separate jiras. > Issues with TimeLineServer (A

[jira] [Commented] (YARN-2863) ResourceManager will shutdown when job's queuename is empty

2014-11-17 Thread Wei Yan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214973#comment-14214973 ] Wei Yan commented on YARN-2863: --- [~397090770], in the hadoop trunk, this problem looks alread

[jira] [Commented] (YARN-2865) Application recovery continuously fails with "Application with id already present. Cannot duplicate"

2014-11-17 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214989#comment-14214989 ] Jian He commented on YARN-2865: --- patch looks good overall. The patch should be enough to fix

[jira] [Commented] (YARN-2868) Add metric for initial container launch time

2014-11-17 Thread Anubhav Dhoot (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215025#comment-14215025 ] Anubhav Dhoot commented on YARN-2868: - Seems like the QueueMetrics change can be revert

[jira] [Commented] (YARN-2868) Add metric for initial container launch time

2014-11-17 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215064#comment-14215064 ] Hadoop QA commented on YARN-2868: - {color:red}-1 overall{color}. Here are the results of t

[jira] [Commented] (YARN-2868) Add metric for initial container launch time

2014-11-17 Thread Ray Chiang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215150#comment-14215150 ] Ray Chiang commented on YARN-2868: -- Thanks. I'll update the patch and test it out. > Add

[jira] [Commented] (YARN-2414) RM web UI: app page will crash if app is failed before any attempt has been created

2014-11-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215174#comment-14215174 ] Jason Lowe commented on YARN-2414: -- +1 lgtm. Committing this. > RM web UI: app page will

[jira] [Commented] (YARN-2414) RM web UI: app page will crash if app is failed before any attempt has been created

2014-11-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215177#comment-14215177 ] Wangda Tan commented on YARN-2414: -- Thanks [~zjshen] reporting this issue, and [~jlowe]'s

[jira] [Commented] (YARN-2414) RM web UI: app page will crash if app is failed before any attempt has been created

2014-11-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215205#comment-14215205 ] Hudson commented on YARN-2414: -- FAILURE: Integrated in Hadoop-trunk-Commit #6559 (See [https:

[jira] [Commented] (YARN-2578) NM does not failover timely if RM node network connection fails

2014-11-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215235#comment-14215235 ] Karthik Kambatla commented on YARN-2578: Since the RM doesn't (yet) suffer from the

[jira] [Commented] (YARN-2745) YARN new pluggable scheduler which does multi-resource packing

2014-11-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215274#comment-14215274 ] Karthik Kambatla commented on YARN-2745: Read the paper and synced up with Srikanth

[jira] [Commented] (YARN-2865) Application recovery continuously fails with "Application with id already present. Cannot duplicate"

2014-11-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215440#comment-14215440 ] Karthik Kambatla commented on YARN-2865: bq. probably we should have a separate cla

[jira] [Commented] (YARN-2738) Add FairReservationSystem for FairScheduler

2014-11-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215495#comment-14215495 ] Karthik Kambatla commented on YARN-2738: I am just wary of adding scheduler configs

[jira] [Commented] (YARN-2690) Make ReservationSystem and its dependent classes independent of Scheduler type

2014-11-17 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215500#comment-14215500 ] Karthik Kambatla commented on YARN-2690: Trusting my previous review, +1. Checking

[jira] [Commented] (YARN-2574) Add support for FairScheduler to the ReservationSystem

2014-11-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215521#comment-14215521 ] Hudson commented on YARN-2574: -- FAILURE: Integrated in Hadoop-trunk-Commit #6563 (See [https:

[jira] [Commented] (YARN-2690) Make ReservationSystem and its dependent classes independent of Scheduler type

2014-11-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215520#comment-14215520 ] Hudson commented on YARN-2690: -- FAILURE: Integrated in Hadoop-trunk-Commit #6563 (See [https:

[jira] [Created] (YARN-2872) CapacityScheduler: Add disk I/O resource to DRF

2014-11-17 Thread Karthik Kambatla (JIRA)
Karthik Kambatla created YARN-2872: -- Summary: CapacityScheduler: Add disk I/O resource to DRF Key: YARN-2872 URL: https://issues.apache.org/jira/browse/YARN-2872 Project: Hadoop YARN Issue T

[jira] [Created] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
zhihai xu created YARN-2873: --- Summary: improve LevelDB error handling for missing files DBException to avoid NM start failure. Key: YARN-2873 URL: https://issues.apache.org/jira/browse/YARN-2873 Project: Ha

[jira] [Commented] (YARN-2745) YARN new pluggable scheduler which does multi-resource packing

2014-11-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215604#comment-14215604 ] Wangda Tan commented on YARN-2745: -- Read through the paper as well. Thanks for great works

[jira] [Updated] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu updated YARN-2873: Attachment: YARN-2873.000.patch > improve LevelDB error handling for missing files DBException to avoid NM >

[jira] [Commented] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215610#comment-14215610 ] zhihai xu commented on YARN-2873: - Uploaded a patch YARN-2873.000.patch to delete levelDB f

[jira] [Updated] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu updated YARN-2873: Description: improve LevelDB error handling for missing files DBException to avoid NM start failure. We saw

[jira] [Commented] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215646#comment-14215646 ] Hadoop QA commented on YARN-2873: - {color:red}-1 overall{color}. Here are the results of t

[jira] [Updated] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu updated YARN-2873: Attachment: YARN-2873.001.patch > improve LevelDB error handling for missing files DBException to avoid NM >

[jira] [Commented] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215658#comment-14215658 ] zhihai xu commented on YARN-2873: - Attached a new patch YARN-2873.001.patch to fix findbugs

[jira] [Updated] (YARN-2863) ResourceManager will shutdown when job's queuename is empty

2014-11-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated YARN-2863: -- Fix Version/s: 3.0.0 > ResourceManager will shutdown when job's queuename is empty >

[jira] [Updated] (YARN-2863) ResourceManager will shutdown when job's queuename is empty

2014-11-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated YARN-2863: -- Labels: hadoop (was: ) > ResourceManager will shutdown when job's queuename is empty > -

[jira] [Commented] (YARN-2863) ResourceManager will shutdown when job's queuename is empty

2014-11-17 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215674#comment-14215674 ] yangping wu commented on YARN-2863: --- Hi Wei Yan, thanks you for your reply. The bug was s

[jira] [Commented] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215676#comment-14215676 ] Hadoop QA commented on YARN-2873: - {color:red}-1 overall{color}. Here are the results of t

[jira] [Commented] (YARN-2745) YARN new pluggable scheduler which does multi-resource packing

2014-11-17 Thread Srikanth Kandula (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215720#comment-14215720 ] Srikanth Kandula commented on YARN-2745: Thanks Wangda. Re: Yarn-314, that would b

[jira] [Assigned] (YARN-1984) LeveldbTimelineStore does not handle db exceptions properly

2014-11-17 Thread Varun Saxena (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Varun Saxena reassigned YARN-1984: -- Assignee: Varun Saxena > LeveldbTimelineStore does not handle db exceptions properly > -

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch New patch addresses all of Karthik's comments. It also uses the config f

[jira] [Commented] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-17 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215817#comment-14215817 ] Hadoop QA commented on YARN-2604: - {color:green}+1 overall{color}. Here are the results of