[ 
https://issues.apache.org/jira/browse/HDFS-15900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17332434#comment-17332434
 ] 

Harunobu Daikoku commented on HDFS-15900:
-----------------------------------------

Hmm, the test is passing for me.
 Are you sure you're on branch-3.1 or did you make some changes there?
{code:none}
[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running 
org.apache.hadoop.hdfs.server.federation.router.TestRouterRPCClientRetries
[INFO] Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 61.895 s 
- in org.apache.hadoop.hdfs.server.federation.router.TestRouterRPCClientRetries
[INFO] 
[INFO] Results:
[INFO] 
[INFO] Tests run: 3, Failures: 0, Errors: 0, Skipped: 0
[INFO] 
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time:  01:33 min
[INFO] Finished at: 2021-04-26T13:45:32Z
[INFO] ------------------------------------------------------------------------
$ git log --oneline -1    
71a9885c978 (HEAD -> branch-3.1, origin/branch-3.1) HDFS-15621. Datanode 
DirectoryS
canner uses excessive memory (#2849). Contributed by Stephen O'Donnell
{code}

> RBF: empty blockpool id on dfsrouter caused by UNAVAILABLE NameNode
> -------------------------------------------------------------------
>
>                 Key: HDFS-15900
>                 URL: https://issues.apache.org/jira/browse/HDFS-15900
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: rbf
>    Affects Versions: 3.3.0
>            Reporter: Harunobu Daikoku
>            Assignee: Harunobu Daikoku
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 3.3.1, 3.4.0, 3.1.5, 3.2.3
>
>         Attachments: image.png
>
>          Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> We observed that when a NameNode becomes UNAVAILABLE, the corresponding 
> blockpool id in MembershipStoreImpl#activeNamespaces on dfsrouter 
> unintentionally sets to empty, its initial value.
>  !image.png|height=250!
> As a result of this, concat operations through dfsrouter fail with the 
> following error as it cannot resolve the block id in the recognized active 
> namespaces.
> {noformat}
> Caused by: 
> org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.RemoteException): 
> Cannot locate a nameservice for block pool BP-...
> {noformat}
> A possible fix is to ignore UNAVAILABLE NameNode registrations, and set 
> proper namespace information obtained from available NameNode registrations 
> when constructing the cache of active namespaces.
>  
> [https://github.com/apache/hadoop/blob/rel/release-3.3.0/hadoop-hdfs-project/hadoop-hdfs-rbf/src/main/java/org/apache/hadoop/hdfs/server/federation/store/impl/MembershipStoreImpl.java#L207-L221]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to