[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105186#comment-13105186
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3007:
----------------------------------------------------

bq. [..] I started getting some other error on getTaskCompletions(). I think 
thats more related to the handshake for history file transfer not getting 
completed becasue of immediate AppMaster shutdown.
Yes, that is MAPREDUCE-3006.

bq. In all, the patch looks good with minor changes to removing the conf 
settings mentioned above.
The conf setting is used in so many other places too. I filed MAPREDUCE-3013 
for removing it.

bq. The only issue with HS is that both the AM and HS implement the same 
interface (MRClientProtocol) one of which is token based and the other kerberos 
info based (the fact that getKerberos in MRClientSecurityInfo returns null 
saves the day).
Yes. This is the reason why I think the configuration way is better, you can 
set the authentication mechanism per implementation of protocol instead of 
per-protocol.

bq. I tried out the patch on a cluster. Had to remove the sleep in AppMaster to 
reproduce the issue. The patch worked for me [..]
Thanks Mahadev!

> JobClient cannot talk to JobHistory server in secure mode
> ---------------------------------------------------------
>
>                 Key: MAPREDUCE-3007
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3007
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: jobhistoryserver, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Vinod Kumar Vavilapalli
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-3007-20110914.2.txt, 
> MAPREDUCE-3007-20110914.txt
>
>
> In secure mode, Jobclient cannot connect to HistoryServer. Thanks to 
> [~karams] for finding this out.
> {code}
> 11/09/14 09:57:51 INFO mapred.ClientServiceDelegate: Application state is 
> completed. Redirecting to job history server
> 11/09/14 09:57:51 INFO security.ApplicationTokenSelector: Looking for a token 
> with service <history-server>:10020
> 11/09/14 09:57:51 INFO security.ApplicationTokenSelector: Token kind is 
> YARN_APPLICATION_TOKEN and the token's service name is <Am-ip>:46257
> 11/09/14 09:57:51 INFO security.UserGroupInformation: Initiating logout for 
> <user-principal>
> 11/09/14 09:57:51 INFO security.UserGroupInformation: Initiating re-login for 
> <user-principal>
> 11/09/14 09:57:55 WARN security.UserGroupInformation: Not attempting to 
> re-login since the last re-login was attempted less than 600 seconds before.
> 11/09/14 09:57:56 WARN security.UserGroupInformation: Not attempting to 
> re-login since the last re-login was attempted less than 600 seconds before.
> 11/09/14 09:58:00 WARN security.UserGroupInformation: Not attempting to 
> re-login since the last re-login was attempted less than 600 seconds before.
> 11/09/14 09:58:05 WARN security.UserGroupInformation: Not attempting to 
> re-login since the last re-login was attempted less than 600 seconds before.
> 11/09/14 09:58:05 WARN ipc.Client: Couldn't setup connection for 
> <user-principal> to null
> 11/09/14 09:58:05 INFO mapred.ClientServiceDelegate: Failed to contact 
> AM/History for job job_1315993268700_0001  Will retry..
> {code}
> Am surprised no one working with YARN+MR ever ran into this!

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to