zhihai xu created YARN-6396:
---
Summary: Call verifyAndCreateRemoteLogDir at service
initialization instead of application initialization to decrease load for name
node
Key: YARN-6396
URL:
zhihai xu created YARN-6392:
---
Summary: add submit time to Application Summary log
Key: YARN-6392
URL: https://issues.apache.org/jira/browse/YARN-6392
Project: Hadoop YARN
Issue Type: Improvement
zhihai xu created YARN-4979:
---
Summary: FSAppAttempt adds duplicate ResourceRequest to demand in
updateDemand.
Key: YARN-4979
URL: https://issues.apache.org/jira/browse/YARN-4979
Project: Hadoop YARN
zhihai xu created YARN-4458:
---
Summary: Compilation error at branch-2.7 due to
getNodeLabelExpression not defined in NMContainerStatusPBImpl.
Key: YARN-4458
URL: https://issues.apache.org/jira/browse/YARN-4458
zhihai xu created YARN-4209:
---
Summary: RMStateStore FENCED state doesn’t work
Key: YARN-4209
URL: https://issues.apache.org/jira/browse/YARN-4209
Project: Hadoop YARN
Issue Type: Bug
[
https://issues.apache.org/jira/browse/YARN-4190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-4190.
-
Resolution: Later
> missing container information in FairScheduler preemption log.
>
zhihai xu created YARN-4187:
---
Summary: Yarn Client uses local address instead RM address as
token renewer in a secure cluster when HA is enabled.
Key: YARN-4187
URL: https://issues.apache.org/jira/browse/YARN-4187
zhihai xu created YARN-4190:
---
Summary: Add container information in FairScheduler preemption log
to help debug.
Key: YARN-4190
URL: https://issues.apache.org/jira/browse/YARN-4190
Project: Hadoop YARN
zhihai xu created YARN-4153:
---
Summary: TestAsyncDispatcher failed at branch-2.7
Key: YARN-4153
URL: https://issues.apache.org/jira/browse/YARN-4153
Project: Hadoop YARN
Issue Type: Bug
zhihai xu created YARN-4095:
---
Summary: Avoid sharing AllocatorPerContext object in
LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService.
Key: YARN-4095
URL:
[
https://issues.apache.org/jira/browse/YARN-3857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3857.
-
Resolution: Fixed
Memory leak in ResourceManager with SIMPLE mode
zhihai xu created YARN-3943:
---
Summary: Use separate threshold configurations for disk-full
detection and disk-not-full detection.
Key: YARN-3943
URL: https://issues.apache.org/jira/browse/YARN-3943
zhihai xu created YARN-3925:
---
Summary: ContainerLogsUtils#getContainerLogFile fails to read
container log files from full disks.
Key: YARN-3925
URL: https://issues.apache.org/jira/browse/YARN-3925
Project:
zhihai xu created YARN-3882:
---
Summary: AggregatedLogFormat should close aclScanner and
ownerScanner after create them.
Key: YARN-3882
URL: https://issues.apache.org/jira/browse/YARN-3882
Project: Hadoop
[
https://issues.apache.org/jira/browse/YARN-3549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3549.
-
Resolution: Duplicate
use JNI-based FileStatus implementation from
io.nativeio.NativeIO.POSIX#getFstat
zhihai xu created YARN-3802:
---
Summary: Two RMNodes for the same NodeId are used in RM sometimes
after NM is reconnected.
Key: YARN-3802
URL: https://issues.apache.org/jira/browse/YARN-3802
Project: Hadoop
zhihai xu created YARN-3780:
---
Summary: Should use equals when compare Resource in
RMNodeImpl#ReconnectNodeTransition
Key: YARN-3780
URL: https://issues.apache.org/jira/browse/YARN-3780
Project: Hadoop YARN
zhihai xu created YARN-3777:
---
Summary: Move all reservation-related tests from TestFairScheduler
to TestFairSchedulerReservations.
Key: YARN-3777
URL: https://issues.apache.org/jira/browse/YARN-3777
zhihai xu created YARN-3727:
---
Summary: For better error recovery, check if the directory exists
before using it for localization.
Key: YARN-3727
URL: https://issues.apache.org/jira/browse/YARN-3727
zhihai xu created YARN-3713:
---
Summary: Remove duplicate function call storeContainerDiagnostics
in ContainerDiagnosticsUpdateTransition
Key: YARN-3713
URL: https://issues.apache.org/jira/browse/YARN-3713
zhihai xu created YARN-3710:
---
Summary: FairScheduler: Should allocate more containers for
assign-multiple after assignReservedContainer turns the reservation into an
allocation.
Key: YARN-3710
URL:
zhihai xu created YARN-3697:
---
Summary: FairScheduler: ContinuousSchedulingThread can't be
shutdown after stop sometimes.
Key: YARN-3697
URL: https://issues.apache.org/jira/browse/YARN-3697
Project: Hadoop
zhihai xu created YARN-3667:
---
Summary: Fix findbugs warning Inconsistent synchronization of
org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.isHDFS
Key: YARN-3667
URL:
zhihai xu created YARN-3655:
---
Summary: FairScheduler: potential deadlock due to maxAMShare
limitation and container reservation
Key: YARN-3655
URL: https://issues.apache.org/jira/browse/YARN-3655
Project:
zhihai xu created YARN-3604:
---
Summary: removeApplication in ZKRMStateStore should also disable
watch.
Key: YARN-3604
URL: https://issues.apache.org/jira/browse/YARN-3604
Project: Hadoop YARN
zhihai xu created YARN-3602:
---
Summary:
TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails
Intermittently due to IOException from cleanup
Key: YARN-3602
URL:
[
https://issues.apache.org/jira/browse/YARN-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3114.
-
Resolution: Not A Problem
It would be better to consider integer(long) overflow when compare the time
[
https://issues.apache.org/jira/browse/YARN-3190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3190.
-
Resolution: Duplicate
issue is fixed by YARN-2964
NM can't aggregate logs: token can't be found in
zhihai xu created YARN-3516:
---
Summary: killing ContainerLocalizer action doesn't take effect
when private localizer receives FETCH_FAILURE status.
Key: YARN-3516
URL: https://issues.apache.org/jira/browse/YARN-3516
[
https://issues.apache.org/jira/browse/YARN-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3496.
-
Resolution: Not A Problem
Add a configuration to disable/enable storing localization state in
zhihai xu created YARN-3491:
---
Summary: Improve the public resource localization to do both
FSDownload submission to the thread pool and completed localization handling in
one thread (PublicLocalizer).
Key: YARN-3491
zhihai xu created YARN-3496:
---
Summary: Add a configuration to disable/enable storing
localization state in NM StateStore
Key: YARN-3496
URL: https://issues.apache.org/jira/browse/YARN-3496
Project: Hadoop
zhihai xu created YARN-3465:
---
Summary: use LinkedHashMap to keep the order of
LocalResourceRequest in ContainerImpl
Key: YARN-3465
URL: https://issues.apache.org/jira/browse/YARN-3465
Project: Hadoop YARN
zhihai xu created YARN-3464:
---
Summary: Race condition in LocalizerRunner causes container
localization timeout.
Key: YARN-3464
URL: https://issues.apache.org/jira/browse/YARN-3464
Project: Hadoop YARN
zhihai xu created YARN-3446:
---
Summary: FairScheduler HeadRoom calculation should exclude nodes
in the blacklist.
Key: YARN-3446
URL: https://issues.apache.org/jira/browse/YARN-3446
Project: Hadoop YARN
zhihai xu created YARN-3429:
---
Summary: TestAMRMTokens.testTokenExpiry fails Intermittently with
error message:Invalid AMRMToken from appattempt_1427804754787_0001_01
Key: YARN-3429
URL:
zhihai xu created YARN-3395:
---
Summary: Handle the user name correctly when submit application
and use user name as default queue name.
Key: YARN-3395
URL: https://issues.apache.org/jira/browse/YARN-3395
zhihai xu created YARN-3385:
---
Summary: Race condition: KeeperException$NoNodeException will
cause RM shutdown during ZK node deletion(Op.delete).
Key: YARN-3385
URL: https://issues.apache.org/jira/browse/YARN-3385
zhihai xu created YARN-3363:
---
Summary: add localization and container launch time to
ContainerMetrics at NM to show these timing information for each active
container.
Key: YARN-3363
URL:
zhihai xu created YARN-3355:
---
Summary: findbugs warning:Inconsistent synchronization of
org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.allocConf
Key: YARN-3355
URL:
zhihai xu created YARN-3349:
---
Summary: treat all exceptions as failure in
testFSRMStateStoreClientRetry
Key: YARN-3349
URL: https://issues.apache.org/jira/browse/YARN-3349
Project: Hadoop YARN
[
https://issues.apache.org/jira/browse/YARN-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3263.
-
Resolution: Not a Problem
This is not an issue.
tokens.rewind() is called before
zhihai xu created YARN-3341:
---
Summary: Fix findbugs warning:BC_UNCONFIRMED_CAST at
FSSchedulerNode.reserveResource
Key: YARN-3341
URL: https://issues.apache.org/jira/browse/YARN-3341
Project: Hadoop YARN
zhihai xu created YARN-3336:
---
Summary: FileSystem memory leak in DelegationTokenRenewer
Key: YARN-3336
URL: https://issues.apache.org/jira/browse/YARN-3336
Project: Hadoop YARN
Issue Type: Bug
zhihai xu created YARN-3263:
---
Summary: ContainerManagerImpl#parseCredentials don't rewind the
ByteBuffer after credentials.readTokenStorageStream
Key: YARN-3263
URL: https://issues.apache.org/jira/browse/YARN-3263
zhihai xu created YARN-3247:
---
Summary: TestQueueMappings failure for FairScheduler
Key: YARN-3247
URL: https://issues.apache.org/jira/browse/YARN-3247
Project: Hadoop YARN
Issue Type: Bug
zhihai xu created YARN-3242:
---
Summary: Old ZK client session watcher event messed up new ZK
client session due to ZooKeeper asynchronously closing client session.
Key: YARN-3242
URL:
zhihai xu created YARN-3241:
---
Summary: Leading space, trailing space and empty sub queue name
may cause MetricsException for fair scheduler
Key: YARN-3241
URL: https://issues.apache.org/jira/browse/YARN-3241
zhihai xu created YARN-3236:
---
Summary: cleanup RMAuthenticationFilter#AUTH_HANDLER_PROPERTY.
Key: YARN-3236
URL: https://issues.apache.org/jira/browse/YARN-3236
Project: Hadoop YARN
Issue Type:
zhihai xu created YARN-3205:
---
Summary: FileSystemRMStateStore should disable FileSystem Cache to
avoid get a Filesystem with an old configuration.
Key: YARN-3205
URL: https://issues.apache.org/jira/browse/YARN-3205
zhihai xu created YARN-3114:
---
Summary: It would be better to consider integer(long) overflow
when compare the time in DelegationTokenRenewer.
Key: YARN-3114
URL: https://issues.apache.org/jira/browse/YARN-3114
zhihai xu created YARN-3106:
---
Summary: The message in IllegalArgumentException gave wrong
information in NMTokenSecretManagerInRM.java and
RMContainerTokenSecretManager.java
Key: YARN-3106
URL:
zhihai xu created YARN-3079:
---
Summary: Scheduler should also update maximumAllocation when
updateNodeResource.
Key: YARN-3079
URL: https://issues.apache.org/jira/browse/YARN-3079
Project: Hadoop YARN
[
https://issues.apache.org/jira/browse/YARN-2679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-2679.
-
Resolution: Fixed
Add metric for container launch duration
zhihai xu created YARN-3056:
---
Summary: add verification for containerLaunchDuration in
TestNodeManagerMetrics.
Key: YARN-3056
URL: https://issues.apache.org/jira/browse/YARN-3056
Project: Hadoop YARN
zhihai xu created YARN-3023:
---
Summary: Race condition in ZKRMStateStore#createWithRetries from
ZooKeeper cause RM crash
Key: YARN-3023
URL: https://issues.apache.org/jira/browse/YARN-3023
Project: Hadoop
[
https://issues.apache.org/jira/browse/YARN-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-3023.
-
Resolution: Duplicate
Race condition in ZKRMStateStore#createWithRetries from ZooKeeper cause RM
crash
zhihai xu created YARN-2873:
---
Summary: improve LevelDB error handling for missing files
DBException to avoid NM start failure.
Key: YARN-2873
URL: https://issues.apache.org/jira/browse/YARN-2873
Project:
zhihai xu created YARN-2831:
---
Summary: NM should kill and cleanup the leaked containers.
Key: YARN-2831
URL: https://issues.apache.org/jira/browse/YARN-2831
Project: Hadoop YARN
Issue Type: Bug
zhihai xu created YARN-2820:
---
Summary: Improve FileSystemRMStateStore update failure exception
handling to not shutdown RM.
Key: YARN-2820
URL: https://issues.apache.org/jira/browse/YARN-2820
Project:
zhihai xu created YARN-2816:
---
Summary: NM fail to start with NPE during container recovery
Key: YARN-2816
URL: https://issues.apache.org/jira/browse/YARN-2816
Project: Hadoop YARN
Issue Type: Bug
zhihai xu created YARN-2802:
---
Summary: add AM container launch and register delay metrics in
QueueMetrics to help diagnose performance issue.
Key: YARN-2802
URL: https://issues.apache.org/jira/browse/YARN-2802
zhihai xu created YARN-2799:
---
Summary: cleanup TestLogAggregationService based on the change in
YARN-90
Key: YARN-2799
URL: https://issues.apache.org/jira/browse/YARN-2799
Project: Hadoop YARN
zhihai xu created YARN-2753:
---
Summary: potential NPE in checkRemoveLabelsFromNode of
CommonNodeLabelsManager
Key: YARN-2753
URL: https://issues.apache.org/jira/browse/YARN-2753
Project: Hadoop YARN
zhihai xu created YARN-2754:
---
Summary: addToCluserNodeLabels should be protected by writeLock in
RMNodeLabelsManager.java.
Key: YARN-2754
URL: https://issues.apache.org/jira/browse/YARN-2754
Project:
zhihai xu created YARN-2756:
---
Summary: use static variable (Resources.none()) for not-running
Node.resource in CommonNodeLabelsManager to save memory.
Key: YARN-2756
URL: https://issues.apache.org/jira/browse/YARN-2756
zhihai xu created YARN-2757:
---
Summary: potential NPE in checkNodeLabelExpression of
SchedulerUtils for nodeLabels.
Key: YARN-2757
URL: https://issues.apache.org/jira/browse/YARN-2757
Project: Hadoop YARN
zhihai xu created YARN-2759:
---
Summary: addToCluserNodeLabels should not change the value in
labelCollections if the key already exists to avoid the Label.resource is reset.
Key: YARN-2759
URL:
zhihai xu created YARN-2735:
---
Summary: diskUtilizationPercentageCutoff and
diskUtilizationSpaceCutoff are initialized twice in DirectoryCollection
Key: YARN-2735
URL: https://issues.apache.org/jira/browse/YARN-2735
zhihai xu created YARN-2682:
---
Summary: WindowsSecureContainerExecutor should not depend on
DefaultContainerExecutor#getFirstApplicationDir.
Key: YARN-2682
URL: https://issues.apache.org/jira/browse/YARN-2682
zhihai xu created YARN-2641:
---
Summary: improve node decommission latency in RM.
Key: YARN-2641
URL: https://issues.apache.org/jira/browse/YARN-2641
Project: Hadoop YARN
Issue Type: Improvement
zhihai xu created YARN-2623:
---
Summary: Linux container executor only use the first local
directory to copy token file in container-executor.c.
Key: YARN-2623
URL: https://issues.apache.org/jira/browse/YARN-2623
zhihai xu created YARN-2566:
---
Summary: IOException happen in startLocalizer of
DefaultContainerExecutor due to not enough disk space for the first localDir.
Key: YARN-2566
URL:
zhihai xu created YARN-2534:
---
Summary: FairScheduler: totalMaxShare is not calculated correctly
in computeSharesInternal
Key: YARN-2534
URL: https://issues.apache.org/jira/browse/YARN-2534
Project: Hadoop
zhihai xu created YARN-2452:
---
Summary: TestRMApplicationHistoryWriter is failed for FairScheduler
Key: YARN-2452
URL: https://issues.apache.org/jira/browse/YARN-2452
Project: Hadoop YARN
Issue
zhihai xu created YARN-2453:
---
Summary: TestProportionalCapacityPreemptionPolicy is failed for
FairScheduler
Key: YARN-2453
URL: https://issues.apache.org/jira/browse/YARN-2453
Project: Hadoop YARN
zhihai xu created YARN-2376:
---
Summary: Too many threads blocking on the global JobTracker lock
from getJobCounters, optimize getJobCounters to release global JobTracker lock
before access the per job counter in JobInProgress
Key:
[
https://issues.apache.org/jira/browse/YARN-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhihai xu resolved YARN-2376.
-
Resolution: Duplicate
Too many threads blocking on the global JobTracker lock from getJobCounters,
zhihai xu created YARN-2359:
---
Summary: Application is hung without timeout and retry after
DNS/network is down.
Key: YARN-2359
URL: https://issues.apache.org/jira/browse/YARN-2359
Project: Hadoop YARN
zhihai xu created YARN-2361:
---
Summary: remove duplicate entries (EXPIRE event) in the EnumSet of
event type in RMAppAttempt state machine
Key: YARN-2361
URL: https://issues.apache.org/jira/browse/YARN-2361
zhihai xu created YARN-2337:
---
Summary: remove duplication function call (setClientRMService) in
resource manage class
Key: YARN-2337
URL: https://issues.apache.org/jira/browse/YARN-2337
Project: Hadoop
zhihai xu created YARN-2324:
---
Summary: Race condition in continuousScheduling for FairScheduler
Key: YARN-2324
URL: https://issues.apache.org/jira/browse/YARN-2324
Project: Hadoop YARN
Issue Type:
zhihai xu created YARN-2325:
---
Summary: need check whether node is null in nodeUpdate for
FairScheduler
Key: YARN-2325
URL: https://issues.apache.org/jira/browse/YARN-2325
Project: Hadoop YARN
zhihai xu created YARN-2315:
---
Summary: Should use setCurrentCapacity instead of setCapacity to
configure used resource capacity for FairScheduler.
Key: YARN-2315
URL: https://issues.apache.org/jira/browse/YARN-2315
84 matches
Mail list logo