[
https://issues.apache.org/jira/browse/HDFS-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13791504#comment-13791504
]
Daryn Sharp commented on HDFS-5322:
-----------------------------------
I don't like the {{SaslRpcServer}} becoming aware of {{StandbyException}} and
{{RetriableException}} - it violates the abstract of the rpc server from the
NN. These exceptions aren't relevant to every server.
Downgrading the visibility of
{{AbstractDelegationTokenSecretManager#getPassword}} to non-public also seems
like a problematic API change for other secret managers.
Again, the basic question driving this change is why
{{FSNamesystem#checkOperation(OperationCategory.WRITE)}} is not throwing during
a transition to active? The namespace is _not writable_ until active so that
behavior is incorrect and should be fixed. There should not be a discrepancy
with how token and non-token connections (plain, kerberos) are handled.
Likewise other calls within the NN are being "lied" to about the state of the
namespace.
> HDFS delegation token not found in cache errors seen on secure HA clusters
> --------------------------------------------------------------------------
>
> Key: HDFS-5322
> URL: https://issues.apache.org/jira/browse/HDFS-5322
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: ha
> Affects Versions: 2.1.1-beta
> Reporter: Arpit Gupta
> Assignee: Jing Zhao
> Attachments: HDFS-5322.000.patch, HDFS-5322.000.patch,
> HDFS-5322.001.patch, HDFS-5322.002.patch, HDFS-5322.003.patch,
> HDFS-5322.004.patch
>
>
> While running HA tests we have seen issues were we see HDFS delegation token
> not found in cache errors causing jobs running to fail.
> {code}
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
> at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:157)
> |2013-10-06 20:14:51,193 INFO [main] mapreduce.Job: Task Id :
> attempt_1381090351344_0001_m_000007_0, Status : FAILED
> Error:
> org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.token.SecretManager$InvalidToken):
> token (HDFS_DELEGATION_TOKEN token 11 for hrt_qa) can't be found in cache
> at org.apache.hadoop.ipc.Client.call(Client.java:1347)
> at org.apache.hadoop.ipc.Client.call(Client.java:1300)
> at
> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206)
> at com.sun.proxy.$Proxy10.getBlockLocations(Unknown Source)
> {code}
--
This message was sent by Atlassian JIRA
(v6.1#6144)