[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931434#comment-16931434 ] Adam Antal commented on HADOOP-14982: - Indeed, the issue in case of AccessControlException is still unresolved. Opened HADOOP-16580 for the AccessControlException case. > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko >Priority: Major > Fix For: 3.1.0, 2.10.0 > > Attachments: HADOOP-14892-001.patch, HADOOP-14892-002.patch, > HADOOP-14982-003.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is:
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16688422#comment-16688422 ] Steve Loughran commented on HADOOP-14982: - I'm seeing this happen, or something similar today, on a 3.1+ branch. {code} 2018-11-15 17:34:44,548 [main] INFO tools.DistCp (DistCp.java:run(144)) - Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, overwrite=false, append=false, useDiff=false, useRdiff=false, fromSnapshot=null, toSnapshot=null, skipCRC=false, blocking=true, numListstatusThreads=0, maxMaps=20, mapBandwidth=0.0, copyStrategy='uniformsize', preserveStatus=[BLOCKSIZE], atomicWorkPath=null, logPath=null, sourceFileListing=null, sourcePaths=[s3a://hwdev-steve-ireland-new/examples], targetPath=s3a://hwdev-steve-new/dest, filtersFile='null', blocksPerChunk=0, copyBufferSize=8192, verboseLog=false}, sourcePaths=[s3a://hwdev-steve-ireland-new/examples], targetPathExists=true, preserveRawXattrsfalse 2018-11-15 17:34:45,039 [main] INFO client.AHSProxy (AHSProxy.java:createAHSProxy(42)) - Connecting to Application History server at host-03/172.27.20.152:10200 2018-11-15 17:34:45,716 [main] WARN shortcircuit.DomainSocketFactory (DomainSocketFactory.java:(116)) - The short-circuit local reads feature cannot be used because libhadoop cannot be loaded. 2018-11-15 17:34:46,138 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] 2018-11-15 17:34:46,439 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] 2018-11-15 17:34:46,442 [main] INFO retry.RetryInvocationHandler (RetryInvocationHandler.java:log(411)) - java.io.IOException: DestHost:destPort host-04:8020 , LocalHost:localPort HW13176.local/192.168.99.1:0. Failed on local exception: java.io.IOException: org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS], while invoking ClientNamenodeProtocolTranslatorPB.getFileInfo over host-04/172.27.18.67:8020 after 1 failover attempts. Trying to failover after sleeping for 900ms. 2018-11-15 17:34:47,637 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] 2018-11-15 17:34:47,638 [main] INFO retry.RetryInvocationHandler (RetryInvocationHandler.java:log(411)) - java.io.IOException: DestHost:destPort host-03:8020 , LocalHost:localPort HW13176.local/192.168.99.1:0. Failed on local exception: java.io.IOException: org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS], while invoking ClientNamenodeProtocolTranslatorPB.getFileInfo over host-03/172.27.20.152:8020 after 2 failover attempts. Trying to failover after sleeping for 1096ms. 2018-11-15 17:34:49,033 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] 2018-11-15 17:34:49,034 [main] INFO retry.RetryInvocationHandler (RetryInvocationHandler.java:log(411)) - java.io.IOException: DestHost:destPort host-04:8020 , LocalHost:localPort HW13176.local/192.168.99.1:0. Failed on local exception: java.io.IOException: org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS], while invoking ClientNamenodeProtocolTranslatorPB.getFileInfo over host-04/172.27.18.67:8020 after 3 failover attempts. Trying to failover after sleeping for 5110ms. 2018-11-15 17:34:54,437 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] 2018-11-15 17:34:54,440 [main] INFO retry.RetryInvocationHandler (RetryInvocationHandler.java:log(411)) - java.io.IOException: DestHost:destPort host-03:8020 , LocalHost:localPort HW13176.local/192.168.99.1:0. Failed on local exception: java.io.IOException: org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS], while invoking ClientNamenodeProtocolTranslatorPB.getFileInfo over host-03/172.27.20.152:8020 after 4 failover attempts. Trying to failover after sleeping for 8056ms. ^C^C^C^C^C^C^C^C^C^C^C^C2018-11-15 17:35:02,922 [main] WARN ipc.Client (Client.java:run(752)) - Exception encountered while connecting to the server : org.apache.hadoop.security.AccessControlException: Client cannot
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16255866#comment-16255866 ] Hudson commented on HADOOP-14982: - SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #13249 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/13249/]) HADOOP-14982. Clients using FailoverOnNetworkExceptionRetry can go into (rkanter: rev f2efaf013f7577948061abbb49c6d17c375e92cc) * (edit) hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/retry/UnreliableImplementation.java * (edit) hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/retry/TestRetryProxy.java * (edit) hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/io/retry/RetryPolicies.java * (edit) hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/retry/UnreliableInterface.java > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Fix For: 3.1.0, 2.10.0 > > Attachments: HADOOP-14892-001.patch, HADOOP-14892-002.patch, > HADOOP-14982-003.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server :
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16246503#comment-16246503 ] Robert Kanter commented on HADOOP-14982: LGTM +1 Any other comments [~daryn]? > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch, HADOOP-14892-002.patch, > HADOOP-14982-003.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 5 failover attempts. Trying to failover after
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16226951#comment-16226951 ] Hadoop QA commented on HADOOP-14982: | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 15m 49s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 3 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 17m 29s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 12m 36s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 38s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 11s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 36s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 26s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 59s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 49s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 11m 19s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 11m 19s{color} | {color:green} the patch passed {color} | | {color:orange}-0{color} | {color:orange} checkstyle {color} | {color:orange} 0m 35s{color} | {color:orange} hadoop-common-project/hadoop-common: The patch generated 12 new + 104 unchanged - 0 fixed = 116 total (was 104) {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 58s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 9m 59s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 39s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 51s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 8m 26s{color} | {color:green} hadoop-common in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 28s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 96m 22s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:5b98639 | | JIRA Issue | HADOOP-14982 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12894978/HADOOP-14982-003.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux b3ad061a5702 3.13.0-119-generic #166-Ubuntu SMP Wed May 3 12:18:55 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/provided.sh | | git revision | trunk / c02d2ba | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.8.0_131 | | findbugs | v3.1.0-RC1 | | checkstyle | https://builds.apache.org/job/PreCommit-HADOOP-Build/13605/artifact/out/diff-checkstyle-hadoop-common-project_hadoop-common.txt | | Test Results | https://builds.apache.org/job/PreCommit-HADOOP-Build/13605/testReport/ | | modules | C: hadoop-common-project/hadoop-common U: hadoop-common-project/hadoop-common | | Console output | https://builds.apache.org/job/PreCommit-HADOOP-Build/13605/console | | Powered by | Apache Yetus 0.7.0-SNAPSHOT http://yetus.apache.org | This message
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16226913#comment-16226913 ] Hadoop QA commented on HADOOP-14982: | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 10s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 3 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 17m 30s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 13m 37s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 39s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 14s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 42s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 22s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 50s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 41s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 10m 42s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 10m 42s{color} | {color:green} the patch passed {color} | | {color:orange}-0{color} | {color:orange} checkstyle {color} | {color:orange} 0m 37s{color} | {color:orange} hadoop-common-project/hadoop-common: The patch generated 12 new + 104 unchanged - 0 fixed = 116 total (was 104) {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 58s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 9m 52s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 40s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 57s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 8m 3s{color} | {color:red} hadoop-common in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 29s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 80m 32s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.ha.TestZKFailoverController | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:5b98639 | | JIRA Issue | HADOOP-14982 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12894978/HADOOP-14982-003.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux 7b1811e3b477 3.13.0-117-generic #164-Ubuntu SMP Fri Apr 7 11:05:26 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/provided.sh | | git revision | trunk / c02d2ba | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.8.0_131 | | findbugs | v3.1.0-RC1 | | checkstyle | https://builds.apache.org/job/PreCommit-HADOOP-Build/13604/artifact/out/diff-checkstyle-hadoop-common-project_hadoop-common.txt | | unit | https://builds.apache.org/job/PreCommit-HADOOP-Build/13604/artifact/out/patch-unit-hadoop-common-project_hadoop-common.txt | | Test Results | https://builds.apache.org/job/PreCommit-HADOOP-Build/13604/testReport/ | | modules | C: hadoop-common-project/hadoop-common U:
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16226774#comment-16226774 ] Hadoop QA commented on HADOOP-14982: | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} docker {color} | {color:red} 5m 59s{color} | {color:red} Docker failed to build yetus/hadoop:5b98639. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | HADOOP-14982 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12894975/HADOOP-14892-002.patch | | Console output | https://builds.apache.org/job/PreCommit-HADOOP-Build/13603/console | | Powered by | Apache Yetus 0.7.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch, HADOOP-14892-002.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1657#comment-1657 ] Peter Bacsko commented on HADOOP-14982: --- [~daryn] how do you get 1011 lines of output? I set the logging level to DEBUG and even in that case it's only 215 lines (in case of Hadoop 3). > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16220972#comment-16220972 ] Hadoop QA commented on HADOOP-14982: | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 4m 32s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 3 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 16m 51s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 20m 4s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 41s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 25s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 59s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 37s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 53s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 43s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 15m 5s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 15m 5s{color} | {color:green} the patch passed {color} | | {color:orange}-0{color} | {color:orange} checkstyle {color} | {color:orange} 0m 40s{color} | {color:orange} hadoop-common-project/hadoop-common: The patch generated 15 new + 104 unchanged - 0 fixed = 119 total (was 104) {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 6s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 8m 48s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 37s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 53s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 7m 48s{color} | {color:green} hadoop-common in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 33s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 95m 29s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:5b98639 | | JIRA Issue | HADOOP-14982 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12894144/HADOOP-14892-001.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux 04ab0b9d8043 3.13.0-129-generic #178-Ubuntu SMP Fri Aug 11 12:48:20 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | trunk / 625039e | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.8.0_131 | | findbugs | v3.1.0-RC1 | | checkstyle | https://builds.apache.org/job/PreCommit-HADOOP-Build/13582/artifact/patchprocess/diff-checkstyle-hadoop-common-project_hadoop-common.txt | | Test Results | https://builds.apache.org/job/PreCommit-HADOOP-Build/13582/testReport/ | | modules | C: hadoop-common-project/hadoop-common U: hadoop-common-project/hadoop-common | | Console output | https://builds.apache.org/job/PreCommit-HADOOP-Build/13582/console | | Powered by | Apache Yetus 0.7.0-SNAPSHOT http://yetus.apache.org | This message was automatically
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16220762#comment-16220762 ] Peter Bacsko commented on HADOOP-14982: --- Thanks [~daryn], will modify the patch accordingly. > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 5 failover attempts. Trying to failover after sleeping for 2776ms. > 17/10/25 06:37:49
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16220709#comment-16220709 ] Daryn Sharp commented on HADOOP-14982: -- The patch should eliminate this disaster: {noformat} $ hadoop fs -ls / 2>&1 | wc -l 1011 {noformat} _1011 lines of output for a fatal condition???_ > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over >
[jira] [Commented] (HADOOP-14982) Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're used without authenticating with kerberos in HA env
[ https://issues.apache.org/jira/browse/HADOOP-14982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16220705#comment-16220705 ] Daryn Sharp commented on HADOOP-14982: -- Unnecessary retries on authentication failure has been a general issue (re)introduced around 2.7 or so that I fixed long and haven't had time to re-fix. The IPC layer itself has already attempted retries for SASL exceptions. If one bubbles out, it's game over. Don't make this kerberos specific at all. Just make SaslException fatal. > Clients using FailoverOnNetworkExceptionRetry can go into a loop if they're > used without authenticating with kerberos in HA env > --- > > Key: HADOOP-14982 > URL: https://issues.apache.org/jira/browse/HADOOP-14982 > Project: Hadoop Common > Issue Type: Bug > Components: common >Reporter: Peter Bacsko >Assignee: Peter Bacsko > Attachments: HADOOP-14892-001.patch > > > If HA is configured for the Resource Manager in a secure environment, using > the mapred client goes into a loop if the user is not authenticated with > Kerberos. > {noformat} > [root@pb6sec-1 ~]# mapred job -list > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:43 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 1 failover attempts. Trying to failover after sleeping for 160ms. > 17/10/25 06:37:43 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:43 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 2 > failover attempts. Trying to failover after sleeping for 582ms. > 17/10/25 06:37:44 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:44 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:44 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)]; Host Details : local host is: > "host_redacted/IP_redacted"; destination host is: "com.host2.redacted:8032; , > while invoking ApplicationClientProtocolPBClientImpl.getApplications over > rm36 after 3 failover attempts. Trying to failover after sleeping for 977ms. > 17/10/25 06:37:45 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm25 > 17/10/25 06:37:45 INFO retry.RetryInvocationHandler: > java.net.ConnectException: Call From host_redacted/IP_redacted to > com.host.redacted:8032 failed on connection exception: > java.net.ConnectException: Connection refused; For more details see: > http://wiki.apache.org/hadoop/ConnectionRefused, while invoking > ApplicationClientProtocolPBClientImpl.getApplications over rm25 after 4 > failover attempts. Trying to failover after sleeping for 1667ms. > 17/10/25 06:37:46 INFO client.ConfiguredRMFailoverProxyProvider: Failing over > to rm36 > 17/10/25 06:37:46 WARN ipc.Client: Exception encountered while connecting to > the server : javax.security.sasl.SaslException: GSS initiate failed [Caused > by GSSException: No valid credentials provided (Mechanism level: Failed to > find any Kerberos tgt)] > 17/10/25 06:37:46 INFO retry.RetryInvocationHandler: java.io.IOException: > Failed on local exception: java.io.IOException: > javax.security.sasl.SaslException: GSS initiate failed [Caused by > GSSException: No valid credentials provided (Mechanism level: Failed to find > any Kerberos tgt)];