[jira] [Created] (YARN-6396) Call verifyAndCreateRemoteLogDir at service initialization instead of application initialization to decrease load for name node

2017-03-26 Thread zhihai xu (JIRA)
zhihai xu created YARN-6396: --- Summary: Call verifyAndCreateRemoteLogDir at service initialization instead of application initialization to decrease load for name node Key: YARN-6396 URL:

[jira] [Created] (YARN-6392) add submit time to Application Summary log

2017-03-26 Thread zhihai xu (JIRA)
zhihai xu created YARN-6392: --- Summary: add submit time to Application Summary log Key: YARN-6392 URL: https://issues.apache.org/jira/browse/YARN-6392 Project: Hadoop YARN Issue Type: Improvement

[jira] [Created] (YARN-4979) FSAppAttempt adds duplicate ResourceRequest to demand in updateDemand.

2016-04-21 Thread zhihai xu (JIRA)
zhihai xu created YARN-4979: --- Summary: FSAppAttempt adds duplicate ResourceRequest to demand in updateDemand. Key: YARN-4979 URL: https://issues.apache.org/jira/browse/YARN-4979 Project: Hadoop YARN

[jira] [Created] (YARN-4458) Compilation error at branch-2.7 due to getNodeLabelExpression not defined in NMContainerStatusPBImpl.

2015-12-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-4458: --- Summary: Compilation error at branch-2.7 due to getNodeLabelExpression not defined in NMContainerStatusPBImpl. Key: YARN-4458 URL: https://issues.apache.org/jira/browse/YARN-4458

[jira] [Created] (YARN-4209) RMStateStore FENCED state doesn’t work

2015-09-28 Thread zhihai xu (JIRA)
zhihai xu created YARN-4209: --- Summary: RMStateStore FENCED state doesn’t work Key: YARN-4209 URL: https://issues.apache.org/jira/browse/YARN-4209 Project: Hadoop YARN Issue Type: Bug

[jira] [Resolved] (YARN-4190) missing container information in FairScheduler preemption log.

2015-09-18 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-4190. - Resolution: Later > missing container information in FairScheduler preemption log. >

[jira] [Created] (YARN-4187) Yarn Client uses local address instead RM address as token renewer in a secure cluster when HA is enabled.

2015-09-18 Thread zhihai xu (JIRA)
zhihai xu created YARN-4187: --- Summary: Yarn Client uses local address instead RM address as token renewer in a secure cluster when HA is enabled. Key: YARN-4187 URL: https://issues.apache.org/jira/browse/YARN-4187

[jira] [Created] (YARN-4190) Add container information in FairScheduler preemption log to help debug.

2015-09-18 Thread zhihai xu (JIRA)
zhihai xu created YARN-4190: --- Summary: Add container information in FairScheduler preemption log to help debug. Key: YARN-4190 URL: https://issues.apache.org/jira/browse/YARN-4190 Project: Hadoop YARN

[jira] [Created] (YARN-4153) TestAsyncDispatcher failed at branch-2.7

2015-09-13 Thread zhihai xu (JIRA)
zhihai xu created YARN-4153: --- Summary: TestAsyncDispatcher failed at branch-2.7 Key: YARN-4153 URL: https://issues.apache.org/jira/browse/YARN-4153 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-4095) Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService.

2015-08-30 Thread zhihai xu (JIRA)
zhihai xu created YARN-4095: --- Summary: Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService. Key: YARN-4095 URL:

[jira] [Resolved] (YARN-3857) Memory leak in ResourceManager with SIMPLE mode

2015-08-18 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3857. - Resolution: Fixed Memory leak in ResourceManager with SIMPLE mode

[jira] [Created] (YARN-3943) Use separate threshold configurations for disk-full detection and disk-not-full detection.

2015-07-20 Thread zhihai xu (JIRA)
zhihai xu created YARN-3943: --- Summary: Use separate threshold configurations for disk-full detection and disk-not-full detection. Key: YARN-3943 URL: https://issues.apache.org/jira/browse/YARN-3943

[jira] [Created] (YARN-3925) ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks.

2015-07-14 Thread zhihai xu (JIRA)
zhihai xu created YARN-3925: --- Summary: ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Key: YARN-3925 URL: https://issues.apache.org/jira/browse/YARN-3925 Project:

[jira] [Created] (YARN-3882) AggregatedLogFormat should close aclScanner and ownerScanner after create them.

2015-07-02 Thread zhihai xu (JIRA)
zhihai xu created YARN-3882: --- Summary: AggregatedLogFormat should close aclScanner and ownerScanner after create them. Key: YARN-3882 URL: https://issues.apache.org/jira/browse/YARN-3882 Project: Hadoop

[jira] [Resolved] (YARN-3549) use JNI-based FileStatus implementation from io.nativeio.NativeIO.POSIX#getFstat instead of shell-based implementation from RawLocalFileSystem in checkLocalDir.

2015-06-14 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3549. - Resolution: Duplicate use JNI-based FileStatus implementation from io.nativeio.NativeIO.POSIX#getFstat

[jira] [Created] (YARN-3802) Two RMNodes for the same NodeId are used in RM sometimes after NM is reconnected.

2015-06-14 Thread zhihai xu (JIRA)
zhihai xu created YARN-3802: --- Summary: Two RMNodes for the same NodeId are used in RM sometimes after NM is reconnected. Key: YARN-3802 URL: https://issues.apache.org/jira/browse/YARN-3802 Project: Hadoop

[jira] [Created] (YARN-3780) Should use equals when compare Resource in RMNodeImpl#ReconnectNodeTransition

2015-06-07 Thread zhihai xu (JIRA)
zhihai xu created YARN-3780: --- Summary: Should use equals when compare Resource in RMNodeImpl#ReconnectNodeTransition Key: YARN-3780 URL: https://issues.apache.org/jira/browse/YARN-3780 Project: Hadoop YARN

[jira] [Created] (YARN-3777) Move all reservation-related tests from TestFairScheduler to TestFairSchedulerReservations.

2015-06-05 Thread zhihai xu (JIRA)
zhihai xu created YARN-3777: --- Summary: Move all reservation-related tests from TestFairScheduler to TestFairSchedulerReservations. Key: YARN-3777 URL: https://issues.apache.org/jira/browse/YARN-3777

[jira] [Created] (YARN-3727) For better error recovery, check if the directory exists before using it for localization.

2015-05-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-3727: --- Summary: For better error recovery, check if the directory exists before using it for localization. Key: YARN-3727 URL: https://issues.apache.org/jira/browse/YARN-3727

[jira] [Created] (YARN-3713) Remove duplicate function call storeContainerDiagnostics in ContainerDiagnosticsUpdateTransition

2015-05-26 Thread zhihai xu (JIRA)
zhihai xu created YARN-3713: --- Summary: Remove duplicate function call storeContainerDiagnostics in ContainerDiagnosticsUpdateTransition Key: YARN-3713 URL: https://issues.apache.org/jira/browse/YARN-3713

[jira] [Created] (YARN-3710) FairScheduler: Should allocate more containers for assign-multiple after assignReservedContainer turns the reservation into an allocation.

2015-05-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-3710: --- Summary: FairScheduler: Should allocate more containers for assign-multiple after assignReservedContainer turns the reservation into an allocation. Key: YARN-3710 URL:

[jira] [Created] (YARN-3697) FairScheduler: ContinuousSchedulingThread can't be shutdown after stop sometimes.

2015-05-21 Thread zhihai xu (JIRA)
zhihai xu created YARN-3697: --- Summary: FairScheduler: ContinuousSchedulingThread can't be shutdown after stop sometimes. Key: YARN-3697 URL: https://issues.apache.org/jira/browse/YARN-3697 Project: Hadoop

[jira] [Created] (YARN-3667) Fix findbugs warning Inconsistent synchronization of org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.isHDFS

2015-05-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-3667: --- Summary: Fix findbugs warning Inconsistent synchronization of org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.isHDFS Key: YARN-3667 URL:

[jira] [Created] (YARN-3655) FairScheduler: potential deadlock due to maxAMShare limitation and container reservation

2015-05-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-3655: --- Summary: FairScheduler: potential deadlock due to maxAMShare limitation and container reservation Key: YARN-3655 URL: https://issues.apache.org/jira/browse/YARN-3655 Project:

[jira] [Created] (YARN-3604) removeApplication in ZKRMStateStore should also disable watch.

2015-05-08 Thread zhihai xu (JIRA)
zhihai xu created YARN-3604: --- Summary: removeApplication in ZKRMStateStore should also disable watch. Key: YARN-3604 URL: https://issues.apache.org/jira/browse/YARN-3604 Project: Hadoop YARN

[jira] [Created] (YARN-3602) TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup

2015-05-08 Thread zhihai xu (JIRA)
zhihai xu created YARN-3602: --- Summary: TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup Key: YARN-3602 URL:

[jira] [Resolved] (YARN-3114) It would be better to consider integer(long) overflow when compare the time in DelegationTokenRenewer.

2015-05-01 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3114. - Resolution: Not A Problem It would be better to consider integer(long) overflow when compare the time

[jira] [Resolved] (YARN-3190) NM can't aggregate logs: token can't be found in cache

2015-04-23 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3190. - Resolution: Duplicate issue is fixed by YARN-2964 NM can't aggregate logs: token can't be found in

[jira] [Created] (YARN-3516) killing ContainerLocalizer action doesn't take effect when private localizer receives FETCH_FAILURE status.

2015-04-20 Thread zhihai xu (JIRA)
zhihai xu created YARN-3516: --- Summary: killing ContainerLocalizer action doesn't take effect when private localizer receives FETCH_FAILURE status. Key: YARN-3516 URL: https://issues.apache.org/jira/browse/YARN-3516

[jira] [Resolved] (YARN-3496) Add a configuration to disable/enable storing localization state in NMLeveldbStateStore

2015-04-17 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3496. - Resolution: Not A Problem Add a configuration to disable/enable storing localization state in

[jira] [Created] (YARN-3491) Improve the public resource localization to do both FSDownload submission to the thread pool and completed localization handling in one thread (PublicLocalizer).

2015-04-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-3491: --- Summary: Improve the public resource localization to do both FSDownload submission to the thread pool and completed localization handling in one thread (PublicLocalizer). Key: YARN-3491

[jira] [Created] (YARN-3496) Add a configuration to disable/enable storing localization state in NM StateStore

2015-04-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-3496: --- Summary: Add a configuration to disable/enable storing localization state in NM StateStore Key: YARN-3496 URL: https://issues.apache.org/jira/browse/YARN-3496 Project: Hadoop

[jira] [Created] (YARN-3465) use LinkedHashMap to keep the order of LocalResourceRequest in ContainerImpl

2015-04-08 Thread zhihai xu (JIRA)
zhihai xu created YARN-3465: --- Summary: use LinkedHashMap to keep the order of LocalResourceRequest in ContainerImpl Key: YARN-3465 URL: https://issues.apache.org/jira/browse/YARN-3465 Project: Hadoop YARN

[jira] [Created] (YARN-3464) Race condition in LocalizerRunner causes container localization timeout.

2015-04-08 Thread zhihai xu (JIRA)
zhihai xu created YARN-3464: --- Summary: Race condition in LocalizerRunner causes container localization timeout. Key: YARN-3464 URL: https://issues.apache.org/jira/browse/YARN-3464 Project: Hadoop YARN

[jira] [Created] (YARN-3446) FairScheduler HeadRoom calculation should exclude nodes in the blacklist.

2015-04-03 Thread zhihai xu (JIRA)
zhihai xu created YARN-3446: --- Summary: FairScheduler HeadRoom calculation should exclude nodes in the blacklist. Key: YARN-3446 URL: https://issues.apache.org/jira/browse/YARN-3446 Project: Hadoop YARN

[jira] [Created] (YARN-3429) TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken from appattempt_1427804754787_0001_000001

2015-03-31 Thread zhihai xu (JIRA)
zhihai xu created YARN-3429: --- Summary: TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken from appattempt_1427804754787_0001_01 Key: YARN-3429 URL:

[jira] [Created] (YARN-3395) Handle the user name correctly when submit application and use user name as default queue name.

2015-03-24 Thread zhihai xu (JIRA)
zhihai xu created YARN-3395: --- Summary: Handle the user name correctly when submit application and use user name as default queue name. Key: YARN-3395 URL: https://issues.apache.org/jira/browse/YARN-3395

[jira] [Created] (YARN-3385) Race condition: KeeperException$NoNodeException will cause RM shutdown during ZK node deletion(Op.delete).

2015-03-22 Thread zhihai xu (JIRA)
zhihai xu created YARN-3385: --- Summary: Race condition: KeeperException$NoNodeException will cause RM shutdown during ZK node deletion(Op.delete). Key: YARN-3385 URL: https://issues.apache.org/jira/browse/YARN-3385

[jira] [Created] (YARN-3363) add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container.

2015-03-17 Thread zhihai xu (JIRA)
zhihai xu created YARN-3363: --- Summary: add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. Key: YARN-3363 URL:

[jira] [Created] (YARN-3355) findbugs warning:Inconsistent synchronization of org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.allocConf

2015-03-16 Thread zhihai xu (JIRA)
zhihai xu created YARN-3355: --- Summary: findbugs warning:Inconsistent synchronization of org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.allocConf Key: YARN-3355 URL:

[jira] [Created] (YARN-3349) treat all exceptions as failure in testFSRMStateStoreClientRetry

2015-03-15 Thread zhihai xu (JIRA)
zhihai xu created YARN-3349: --- Summary: treat all exceptions as failure in testFSRMStateStoreClientRetry Key: YARN-3349 URL: https://issues.apache.org/jira/browse/YARN-3349 Project: Hadoop YARN

[jira] [Resolved] (YARN-3263) ContainerManagerImpl#parseCredentials don't rewind the ByteBuffer after credentials.readTokenStorageStream

2015-03-13 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3263. - Resolution: Not a Problem This is not an issue. tokens.rewind() is called before

[jira] [Created] (YARN-3341) Fix findbugs warning:BC_UNCONFIRMED_CAST at FSSchedulerNode.reserveResource

2015-03-12 Thread zhihai xu (JIRA)
zhihai xu created YARN-3341: --- Summary: Fix findbugs warning:BC_UNCONFIRMED_CAST at FSSchedulerNode.reserveResource Key: YARN-3341 URL: https://issues.apache.org/jira/browse/YARN-3341 Project: Hadoop YARN

[jira] [Created] (YARN-3336) FileSystem memory leak in DelegationTokenRenewer

2015-03-11 Thread zhihai xu (JIRA)
zhihai xu created YARN-3336: --- Summary: FileSystem memory leak in DelegationTokenRenewer Key: YARN-3336 URL: https://issues.apache.org/jira/browse/YARN-3336 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-3263) ContainerManagerImpl#parseCredentials don't rewind the ByteBuffer after credentials.readTokenStorageStream

2015-02-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-3263: --- Summary: ContainerManagerImpl#parseCredentials don't rewind the ByteBuffer after credentials.readTokenStorageStream Key: YARN-3263 URL: https://issues.apache.org/jira/browse/YARN-3263

[jira] [Created] (YARN-3247) TestQueueMappings failure for FairScheduler

2015-02-23 Thread zhihai xu (JIRA)
zhihai xu created YARN-3247: --- Summary: TestQueueMappings failure for FairScheduler Key: YARN-3247 URL: https://issues.apache.org/jira/browse/YARN-3247 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-3242) Old ZK client session watcher event messed up new ZK client session due to ZooKeeper asynchronously closing client session.

2015-02-21 Thread zhihai xu (JIRA)
zhihai xu created YARN-3242: --- Summary: Old ZK client session watcher event messed up new ZK client session due to ZooKeeper asynchronously closing client session. Key: YARN-3242 URL:

[jira] [Created] (YARN-3241) Leading space, trailing space and empty sub queue name may cause MetricsException for fair scheduler

2015-02-20 Thread zhihai xu (JIRA)
zhihai xu created YARN-3241: --- Summary: Leading space, trailing space and empty sub queue name may cause MetricsException for fair scheduler Key: YARN-3241 URL: https://issues.apache.org/jira/browse/YARN-3241

[jira] [Created] (YARN-3236) cleanup RMAuthenticationFilter#AUTH_HANDLER_PROPERTY.

2015-02-19 Thread zhihai xu (JIRA)
zhihai xu created YARN-3236: --- Summary: cleanup RMAuthenticationFilter#AUTH_HANDLER_PROPERTY. Key: YARN-3236 URL: https://issues.apache.org/jira/browse/YARN-3236 Project: Hadoop YARN Issue Type:

[jira] [Created] (YARN-3205) FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration.

2015-02-16 Thread zhihai xu (JIRA)
zhihai xu created YARN-3205: --- Summary: FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Key: YARN-3205 URL: https://issues.apache.org/jira/browse/YARN-3205

[jira] [Created] (YARN-3114) It would be better to consider integer(long) overflow when compare the time in DelegationTokenRenewer.

2015-01-29 Thread zhihai xu (JIRA)
zhihai xu created YARN-3114: --- Summary: It would be better to consider integer(long) overflow when compare the time in DelegationTokenRenewer. Key: YARN-3114 URL: https://issues.apache.org/jira/browse/YARN-3114

[jira] [Created] (YARN-3106) The message in IllegalArgumentException gave wrong information in NMTokenSecretManagerInRM.java and RMContainerTokenSecretManager.java

2015-01-28 Thread zhihai xu (JIRA)
zhihai xu created YARN-3106: --- Summary: The message in IllegalArgumentException gave wrong information in NMTokenSecretManagerInRM.java and RMContainerTokenSecretManager.java Key: YARN-3106 URL:

[jira] [Created] (YARN-3079) Scheduler should also update maximumAllocation when updateNodeResource.

2015-01-21 Thread zhihai xu (JIRA)
zhihai xu created YARN-3079: --- Summary: Scheduler should also update maximumAllocation when updateNodeResource. Key: YARN-3079 URL: https://issues.apache.org/jira/browse/YARN-3079 Project: Hadoop YARN

[jira] [Resolved] (YARN-2679) Add metric for container launch duration

2015-01-13 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-2679. - Resolution: Fixed Add metric for container launch duration

[jira] [Created] (YARN-3056) add verification for containerLaunchDuration in TestNodeManagerMetrics.

2015-01-13 Thread zhihai xu (JIRA)
zhihai xu created YARN-3056: --- Summary: add verification for containerLaunchDuration in TestNodeManagerMetrics. Key: YARN-3056 URL: https://issues.apache.org/jira/browse/YARN-3056 Project: Hadoop YARN

[jira] [Created] (YARN-3023) Race condition in ZKRMStateStore#createWithRetries from ZooKeeper cause RM crash

2015-01-08 Thread zhihai xu (JIRA)
zhihai xu created YARN-3023: --- Summary: Race condition in ZKRMStateStore#createWithRetries from ZooKeeper cause RM crash Key: YARN-3023 URL: https://issues.apache.org/jira/browse/YARN-3023 Project: Hadoop

[jira] [Resolved] (YARN-3023) Race condition in ZKRMStateStore#createWithRetries from ZooKeeper cause RM crash

2015-01-08 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-3023. - Resolution: Duplicate Race condition in ZKRMStateStore#createWithRetries from ZooKeeper cause RM crash

[jira] [Created] (YARN-2873) improve LevelDB error handling for missing files DBException to avoid NM start failure.

2014-11-17 Thread zhihai xu (JIRA)
zhihai xu created YARN-2873: --- Summary: improve LevelDB error handling for missing files DBException to avoid NM start failure. Key: YARN-2873 URL: https://issues.apache.org/jira/browse/YARN-2873 Project:

[jira] [Created] (YARN-2831) NM should kill and cleanup the leaked containers.

2014-11-07 Thread zhihai xu (JIRA)
zhihai xu created YARN-2831: --- Summary: NM should kill and cleanup the leaked containers. Key: YARN-2831 URL: https://issues.apache.org/jira/browse/YARN-2831 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-2820) Improve FileSystemRMStateStore update failure exception handling to not shutdown RM.

2014-11-06 Thread zhihai xu (JIRA)
zhihai xu created YARN-2820: --- Summary: Improve FileSystemRMStateStore update failure exception handling to not shutdown RM. Key: YARN-2820 URL: https://issues.apache.org/jira/browse/YARN-2820 Project:

[jira] [Created] (YARN-2816) NM fail to start with NPE during container recovery

2014-11-05 Thread zhihai xu (JIRA)
zhihai xu created YARN-2816: --- Summary: NM fail to start with NPE during container recovery Key: YARN-2816 URL: https://issues.apache.org/jira/browse/YARN-2816 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-2802) add AM container launch and register delay metrics in QueueMetrics to help diagnose performance issue.

2014-11-03 Thread zhihai xu (JIRA)
zhihai xu created YARN-2802: --- Summary: add AM container launch and register delay metrics in QueueMetrics to help diagnose performance issue. Key: YARN-2802 URL: https://issues.apache.org/jira/browse/YARN-2802

[jira] [Created] (YARN-2799) cleanup TestLogAggregationService based on the change in YARN-90

2014-11-02 Thread zhihai xu (JIRA)
zhihai xu created YARN-2799: --- Summary: cleanup TestLogAggregationService based on the change in YARN-90 Key: YARN-2799 URL: https://issues.apache.org/jira/browse/YARN-2799 Project: Hadoop YARN

[jira] [Created] (YARN-2753) potential NPE in checkRemoveLabelsFromNode of CommonNodeLabelsManager

2014-10-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-2753: --- Summary: potential NPE in checkRemoveLabelsFromNode of CommonNodeLabelsManager Key: YARN-2753 URL: https://issues.apache.org/jira/browse/YARN-2753 Project: Hadoop YARN

[jira] [Created] (YARN-2754) addToCluserNodeLabels should be protected by writeLock in RMNodeLabelsManager.java.

2014-10-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-2754: --- Summary: addToCluserNodeLabels should be protected by writeLock in RMNodeLabelsManager.java. Key: YARN-2754 URL: https://issues.apache.org/jira/browse/YARN-2754 Project:

[jira] [Created] (YARN-2756) use static variable (Resources.none()) for not-running Node.resource in CommonNodeLabelsManager to save memory.

2014-10-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-2756: --- Summary: use static variable (Resources.none()) for not-running Node.resource in CommonNodeLabelsManager to save memory. Key: YARN-2756 URL: https://issues.apache.org/jira/browse/YARN-2756

[jira] [Created] (YARN-2757) potential NPE in checkNodeLabelExpression of SchedulerUtils for nodeLabels.

2014-10-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-2757: --- Summary: potential NPE in checkNodeLabelExpression of SchedulerUtils for nodeLabels. Key: YARN-2757 URL: https://issues.apache.org/jira/browse/YARN-2757 Project: Hadoop YARN

[jira] [Created] (YARN-2759) addToCluserNodeLabels should not change the value in labelCollections if the key already exists to avoid the Label.resource is reset.

2014-10-27 Thread zhihai xu (JIRA)
zhihai xu created YARN-2759: --- Summary: addToCluserNodeLabels should not change the value in labelCollections if the key already exists to avoid the Label.resource is reset. Key: YARN-2759 URL:

[jira] [Created] (YARN-2735) diskUtilizationPercentageCutoff and diskUtilizationSpaceCutoff are initialized twice in DirectoryCollection

2014-10-23 Thread zhihai xu (JIRA)
zhihai xu created YARN-2735: --- Summary: diskUtilizationPercentageCutoff and diskUtilizationSpaceCutoff are initialized twice in DirectoryCollection Key: YARN-2735 URL: https://issues.apache.org/jira/browse/YARN-2735

[jira] [Created] (YARN-2682) WindowsSecureContainerExecutor should not depend on DefaultContainerExecutor#getFirstApplicationDir.

2014-10-13 Thread zhihai xu (JIRA)
zhihai xu created YARN-2682: --- Summary: WindowsSecureContainerExecutor should not depend on DefaultContainerExecutor#getFirstApplicationDir. Key: YARN-2682 URL: https://issues.apache.org/jira/browse/YARN-2682

[jira] [Created] (YARN-2641) improve node decommission latency in RM.

2014-10-02 Thread zhihai xu (JIRA)
zhihai xu created YARN-2641: --- Summary: improve node decommission latency in RM. Key: YARN-2641 URL: https://issues.apache.org/jira/browse/YARN-2641 Project: Hadoop YARN Issue Type: Improvement

[jira] [Created] (YARN-2623) Linux container executor only use the first local directory to copy token file in container-executor.c.

2014-09-29 Thread zhihai xu (JIRA)
zhihai xu created YARN-2623: --- Summary: Linux container executor only use the first local directory to copy token file in container-executor.c. Key: YARN-2623 URL: https://issues.apache.org/jira/browse/YARN-2623

[jira] [Created] (YARN-2566) IOException happen in startLocalizer of DefaultContainerExecutor due to not enough disk space for the first localDir.

2014-09-17 Thread zhihai xu (JIRA)
zhihai xu created YARN-2566: --- Summary: IOException happen in startLocalizer of DefaultContainerExecutor due to not enough disk space for the first localDir. Key: YARN-2566 URL:

[jira] [Created] (YARN-2534) FairScheduler: totalMaxShare is not calculated correctly in computeSharesInternal

2014-09-10 Thread zhihai xu (JIRA)
zhihai xu created YARN-2534: --- Summary: FairScheduler: totalMaxShare is not calculated correctly in computeSharesInternal Key: YARN-2534 URL: https://issues.apache.org/jira/browse/YARN-2534 Project: Hadoop

[jira] [Created] (YARN-2452) TestRMApplicationHistoryWriter is failed for FairScheduler

2014-08-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-2452: --- Summary: TestRMApplicationHistoryWriter is failed for FairScheduler Key: YARN-2452 URL: https://issues.apache.org/jira/browse/YARN-2452 Project: Hadoop YARN Issue

[jira] [Created] (YARN-2453) TestProportionalCapacityPreemptionPolicy is failed for FairScheduler

2014-08-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-2453: --- Summary: TestProportionalCapacityPreemptionPolicy is failed for FairScheduler Key: YARN-2453 URL: https://issues.apache.org/jira/browse/YARN-2453 Project: Hadoop YARN

[jira] [Created] (YARN-2376) Too many threads blocking on the global JobTracker lock from getJobCounters, optimize getJobCounters to release global JobTracker lock before access the per job counter in

2014-07-31 Thread zhihai xu (JIRA)
zhihai xu created YARN-2376: --- Summary: Too many threads blocking on the global JobTracker lock from getJobCounters, optimize getJobCounters to release global JobTracker lock before access the per job counter in JobInProgress Key:

[jira] [Resolved] (YARN-2376) Too many threads blocking on the global JobTracker lock from getJobCounters, optimize getJobCounters to release global JobTracker lock before access the per job counter i

2014-07-31 Thread zhihai xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhihai xu resolved YARN-2376. - Resolution: Duplicate Too many threads blocking on the global JobTracker lock from getJobCounters,

[jira] [Created] (YARN-2359) Application is hung without timeout and retry after DNS/network is down.

2014-07-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-2359: --- Summary: Application is hung without timeout and retry after DNS/network is down. Key: YARN-2359 URL: https://issues.apache.org/jira/browse/YARN-2359 Project: Hadoop YARN

[jira] [Created] (YARN-2361) remove duplicate entries (EXPIRE event) in the EnumSet of event type in RMAppAttempt state machine

2014-07-25 Thread zhihai xu (JIRA)
zhihai xu created YARN-2361: --- Summary: remove duplicate entries (EXPIRE event) in the EnumSet of event type in RMAppAttempt state machine Key: YARN-2361 URL: https://issues.apache.org/jira/browse/YARN-2361

[jira] [Created] (YARN-2337) remove duplication function call (setClientRMService) in resource manage class

2014-07-23 Thread zhihai xu (JIRA)
zhihai xu created YARN-2337: --- Summary: remove duplication function call (setClientRMService) in resource manage class Key: YARN-2337 URL: https://issues.apache.org/jira/browse/YARN-2337 Project: Hadoop

[jira] [Created] (YARN-2324) Race condition in continuousScheduling for FairScheduler

2014-07-20 Thread zhihai xu (JIRA)
zhihai xu created YARN-2324: --- Summary: Race condition in continuousScheduling for FairScheduler Key: YARN-2324 URL: https://issues.apache.org/jira/browse/YARN-2324 Project: Hadoop YARN Issue Type:

[jira] [Created] (YARN-2325) need check whether node is null in nodeUpdate for FairScheduler

2014-07-20 Thread zhihai xu (JIRA)
zhihai xu created YARN-2325: --- Summary: need check whether node is null in nodeUpdate for FairScheduler Key: YARN-2325 URL: https://issues.apache.org/jira/browse/YARN-2325 Project: Hadoop YARN

[jira] [Created] (YARN-2315) Should use setCurrentCapacity instead of setCapacity to configure used resource capacity for FairScheduler.

2014-07-17 Thread zhihai xu (JIRA)
zhihai xu created YARN-2315: --- Summary: Should use setCurrentCapacity instead of setCapacity to configure used resource capacity for FairScheduler. Key: YARN-2315 URL: https://issues.apache.org/jira/browse/YARN-2315