[
https://issues.apache.org/jira/browse/HDFS-15900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Harunobu Daikoku updated HDFS-15900:
------------------------------------
Description:
We observed that when a NameNode becomes UNAVAILABLE, the corresponding
blockpool id in MembershipStoreImpl#activeNamespaces on dfsrouter
unintentionally sets to empty, its initial value.
!image.png|height=250!
As a result of this, concat operations through dfsrouter fail with the
following error as it cannot resolve the block id in the recognized active
namespaces.
{noformat}
Caused by:
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.RemoteException):
Cannot locate a nameservice for block pool BP-...
{noformat}
A possible fix is to ignore UNAVAILABLE NameNode registrations, and set proper
namespace information obtained from available NameNode registrations when
constructing the cache of active namespaces.
[https://github.com/apache/hadoop/blob/rel/release-3.3.0/hadoop-hdfs-project/hadoop-hdfs-rbf/src/main/java/org/apache/hadoop/hdfs/server/federation/store/impl/MembershipStoreImpl.java#L207-L221]
was:
We observed that when a NameNode becomes UNAVAILABLE, the corresponding
blockpool id in MembershipStoreImpl#activeNamespaces on dfsrouter
unintentionally sets to empty, its initial value.
!image.png|height=250!
As a result of this, concat operations through dfsrouter fail with the
following error as it cannot resolve the block id in the recognized active
namespaces.
{noformat}
Caused by:
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.RemoteException):
Cannot locate a nameservice for block pool BP-...
{noformat}
A possible fix is to ignore UNAVAILABLE NameNode registrations, and set proper
namespace information obtained from available NameNodes when constructing the
cache of active namespaces.
[https://github.com/apache/hadoop/blob/rel/release-3.3.0/hadoop-hdfs-project/hadoop-hdfs-rbf/src/main/java/org/apache/hadoop/hdfs/server/federation/store/impl/MembershipStoreImpl.java#L207-L221]
> RBF: empty blockpool id on dfsrouter caused by UNAVAILABLE NameNode
> -------------------------------------------------------------------
>
> Key: HDFS-15900
> URL: https://issues.apache.org/jira/browse/HDFS-15900
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: rbf
> Affects Versions: 3.3.0
> Reporter: Harunobu Daikoku
> Priority: Minor
> Attachments: image.png
>
>
> We observed that when a NameNode becomes UNAVAILABLE, the corresponding
> blockpool id in MembershipStoreImpl#activeNamespaces on dfsrouter
> unintentionally sets to empty, its initial value.
> !image.png|height=250!
> As a result of this, concat operations through dfsrouter fail with the
> following error as it cannot resolve the block id in the recognized active
> namespaces.
> {noformat}
> Caused by:
> org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.RemoteException):
> Cannot locate a nameservice for block pool BP-...
> {noformat}
> A possible fix is to ignore UNAVAILABLE NameNode registrations, and set
> proper namespace information obtained from available NameNode registrations
> when constructing the cache of active namespaces.
>
> [https://github.com/apache/hadoop/blob/rel/release-3.3.0/hadoop-hdfs-project/hadoop-hdfs-rbf/src/main/java/org/apache/hadoop/hdfs/server/federation/store/impl/MembershipStoreImpl.java#L207-L221]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]