[ 
https://issues.apache.org/jira/browse/HDFS-14134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lukas Majercak updated HDFS-14134:
----------------------------------
    Description: 
Currently, some operations that throw IOException on the NameNode are evaluated 
by RetryPolicy as FAILOVER_AND_RETRY, but they should just fail fast.

For example, when calling getXAttr("user.some_attr", file") where file does not 
have the attribute, NN throws an IOException with message "could not find 
attr". The current client retry policy determines the action for that to be 
FAILOVER_AND_RETRY. The client then fails over and retries until it reaches the 
maximum number of retries. Supposedly, the client should be able to tell that 
this exception is normal and fail fast. 

Moreover, even if the action was FAIL, the RetryInvocationHandler looks at all 
the retry actions from all requests, and FAILOVER_AND_RETRY takes precedence 
over FAIL action.

> Idempotent operations throwing RemoteException should not be retried by the 
> client
> ----------------------------------------------------------------------------------
>
>                 Key: HDFS-14134
>                 URL: https://issues.apache.org/jira/browse/HDFS-14134
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: hdfs, hdfs-client, ipc
>            Reporter: Lukas Majercak
>            Assignee: Lukas Majercak
>            Priority: Critical
>
> Currently, some operations that throw IOException on the NameNode are 
> evaluated by RetryPolicy as FAILOVER_AND_RETRY, but they should just fail 
> fast.
> For example, when calling getXAttr("user.some_attr", file") where file does 
> not have the attribute, NN throws an IOException with message "could not find 
> attr". The current client retry policy determines the action for that to be 
> FAILOVER_AND_RETRY. The client then fails over and retries until it reaches 
> the maximum number of retries. Supposedly, the client should be able to tell 
> that this exception is normal and fail fast. 
> Moreover, even if the action was FAIL, the RetryInvocationHandler looks at 
> all the retry actions from all requests, and FAILOVER_AND_RETRY takes 
> precedence over FAIL action.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to