[
https://issues.apache.org/jira/browse/YARN-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15384016#comment-15384016
]
Hadoop QA commented on YARN-5309:
---------------------------------
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 13m 21s
{color} | {color:blue} Docker mode activated. {color} |
| {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s
{color} | {color:green} The patch does not contain any @author tags. {color} |
| {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m
0s {color} | {color:green} The patch appears to include 1 new or modified test
files. {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 7m
35s {color} | {color:green} branch-2.8 passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 28s
{color} | {color:green} branch-2.8 passed with JDK v1.8.0_91 {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 31s
{color} | {color:green} branch-2.8 passed with JDK v1.7.0_101 {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m
21s {color} | {color:green} branch-2.8 passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 35s
{color} | {color:green} branch-2.8 passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 0m
15s {color} | {color:green} branch-2.8 passed {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m
13s {color} | {color:green} branch-2.8 passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 30s
{color} | {color:green} branch-2.8 passed with JDK v1.8.0_91 {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 34s
{color} | {color:green} branch-2.8 passed with JDK v1.7.0_101 {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m
29s {color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 24s
{color} | {color:green} the patch passed with JDK v1.8.0_91 {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 24s
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 26s
{color} | {color:green} the patch passed with JDK v1.7.0_101 {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 26s
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m
18s {color} | {color:green} hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common:
The patch generated 0 new + 21 unchanged - 2 fixed = 21 total (was 23) {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 31s
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 0m
12s {color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m
1s {color} | {color:green} The patch has no whitespace issues. {color} |
| {color:green}+1{color} | {color:green} xml {color} | {color:green} 0m 0s
{color} | {color:green} The patch has no ill-formed XML file. {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m
29s {color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 28s
{color} | {color:green} the patch passed with JDK v1.8.0_91 {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 33s
{color} | {color:green} the patch passed with JDK v1.7.0_101 {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 2m 15s
{color} | {color:green} hadoop-yarn-common in the patch passed with JDK
v1.8.0_91. {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 2m 34s
{color} | {color:green} hadoop-yarn-common in the patch passed with JDK
v1.7.0_101. {color} |
| {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m
19s {color} | {color:green} The patch does not generate ASF License warnings.
{color} |
| {color:black}{color} | {color:black} {color} | {color:black} 36m 33s {color}
| {color:black} {color} |
\\
\\
|| Subsystem || Report/Notes ||
| Docker | Image:yetus/hadoop:5af2af1 |
| JIRA Patch URL |
https://issues.apache.org/jira/secure/attachment/12818795/YARN-5309.branch-2.8.001.patch
|
| JIRA Issue | YARN-5309 |
| Optional Tests | asflicense compile javac javadoc mvninstall mvnsite
unit xml findbugs checkstyle |
| uname | Linux 2623ade54897 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed
Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux |
| Build tool | maven |
| Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh
|
| git revision | branch-2.8 / cf79ac5 |
| Default Java | 1.7.0_101 |
| Multi-JDK versions | /usr/lib/jvm/java-8-oracle:1.8.0_91
/usr/lib/jvm/java-7-openjdk-amd64:1.7.0_101 |
| findbugs | v3.0.0 |
| JDK v1.7.0_101 Test Results |
https://builds.apache.org/job/PreCommit-YARN-Build/12370/testReport/ |
| modules | C: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common U:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common |
| Console output |
https://builds.apache.org/job/PreCommit-YARN-Build/12370/console |
| Powered by | Apache Yetus 0.3.0 http://yetus.apache.org |
This message was automatically generated.
> Fix SSLFactory truststore reloader thread leak in TimelineClientImpl
> --------------------------------------------------------------------
>
> Key: YARN-5309
> URL: https://issues.apache.org/jira/browse/YARN-5309
> Project: Hadoop YARN
> Issue Type: Bug
> Components: timelineserver, yarn
> Affects Versions: 2.7.1
> Reporter: Thomas Friedrich
> Assignee: Weiwei Yang
> Priority: Blocker
> Attachments: YARN-5309.001.patch, YARN-5309.002.patch,
> YARN-5309.003.patch, YARN-5309.004.patch, YARN-5309.005.patch,
> YARN-5309.branch-2.7.3.001.patch, YARN-5309.branch-2.8.001.patch
>
>
> We found a similar issue as HADOOP-11368 in TimelineClientImpl. The class
> creates an instance of SSLFactory in newSslConnConfigurator and subsequently
> creates the ReloadingX509TrustManager instance which in turn starts a trust
> store reloader thread.
> However, the SSLFactory is never destroyed and hence the trust store reloader
> threads are not killed.
> This problem was observed by a customer who had SSL enabled in Hadoop and
> submitted many queries against the HiveServer2. After a few days, the HS2
> instance crashed and from the Java dump we could see many (over 13000)
> threads like this:
> "Truststore reloader thread" #126 daemon prio=5 os_prio=0
> tid=0x00007f680d2e3000 nid=0x98fd waiting on
> condition [0x00007f67e482c000]
> java.lang.Thread.State: TIMED_WAITING (sleeping)
> at java.lang.Thread.sleep(Native Method)
> at org.apache.hadoop.security.ssl.ReloadingX509TrustManager.run
> (ReloadingX509TrustManager.java:225)
> at java.lang.Thread.run(Thread.java:745)
> HiveServer2 uses the JobClient to submit a job:
> Thread [HiveServer2-Background-Pool: Thread-188] (Suspended (breakpoint at
> line 89 in
> ReloadingX509TrustManager))
> owns: Object (id=464)
> owns: Object (id=465)
> owns: Object (id=466)
> owns: ServiceLoader<S> (id=210)
> ReloadingX509TrustManager.<init>(String, String, String, long) line: 89
> FileBasedKeyStoresFactory.init(SSLFactory$Mode) line: 209
> SSLFactory.init() line: 131
> TimelineClientImpl.newSslConnConfigurator(int, Configuration) line: 532
> TimelineClientImpl.newConnConfigurator(Configuration) line: 507
> TimelineClientImpl.serviceInit(Configuration) line: 269
> TimelineClientImpl(AbstractService).init(Configuration) line: 163
> YarnClientImpl.serviceInit(Configuration) line: 169
> YarnClientImpl(AbstractService).init(Configuration) line: 163
> ResourceMgrDelegate.serviceInit(Configuration) line: 102
> ResourceMgrDelegate(AbstractService).init(Configuration) line: 163
> ResourceMgrDelegate.<init>(YarnConfiguration) line: 96
> YARNRunner.<init>(Configuration) line: 112
> YarnClientProtocolProvider.create(Configuration) line: 34
> Cluster.initialize(InetSocketAddress, Configuration) line: 95
> Cluster.<init>(InetSocketAddress, Configuration) line: 82
> Cluster.<init>(Configuration) line: 75
> JobClient.init(JobConf) line: 475
> JobClient.<init>(JobConf) line: 454
> MapRedTask(ExecDriver).execute(DriverContext) line: 401
> MapRedTask.execute(DriverContext) line: 137
> MapRedTask(Task<T>).executeTask() line: 160
> TaskRunner.runSequential() line: 88
> Driver.launchTask(Task<Serializable>, String, boolean, String, int,
> DriverContext) line: 1653
> Driver.execute() line: 1412
> For every job, a new instance of JobClient/YarnClientImpl/TimelineClientImpl
> is created. But because the HS2 process stays up for days, the previous trust
> store reloader threads are still hanging around in the HS2 process and
> eventually use all the resources available.
> It seems like a similar fix as HADOOP-11368 is needed in TimelineClientImpl
> but it doesn't have a destroy method to begin with.
> One option to avoid this problem is to disable the yarn timeline service
> (yarn.timeline-service.enabled=false).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]