[ 
https://issues.apache.org/jira/browse/HDFS-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15381726#comment-15381726
 ] 

Ming Ma commented on HDFS-10467:
--------------------------------

[~elgoiri], nice work! Here are couple more questions about the design. I will 
post code review questions in sub jiras.

* Support for mergeFs HADOOP-8298. We should be able to extend the design to 
support this.There might be some issues around how to provision a new sub 
folder (which namespace should own that) and how it works with rebalancer. This 
could be a good addition for future work section.
* Handling of inconsistent state. Given routers cache which namenodes are 
active, the state could be different from the actual namenode at that moment. 
Thus routers might get {{StandbyException}} and need to retry on another 
namenode. If so, does it mean the routers should leverage ipc 
{{FailoverOnNetworkExceptionRetry}} or use DFSClient with hint for active 
namenode?
* Soft state vs hard state. while subcluster active namenode machine and 
load/space are soft state that can be reconstructed from namenodes; mount table 
is hard state that need to be persisted. Is there any benefit separating them 
out to use different state stores as they have different persistence 
requirement, access patterns(mount table does't change much while load/space 
update is frequent) and admin interface? For example, admin might want to 
update mount table on demand; but not load/space state.
* Usage of subcluster load/space state. Is it correct that the only consumer of 
subcluster's load/space state is the rebalancer? I image initially we would run 
rebalancer manually. For that, the rebalancer can just pull subcluster's 
load/space state from namenodes on demand. Then we don't have to store 
subcluster load/space state in state store.
* Admin's modification of mount table. Besides rebalancer, admin might want to 
update mount table during cluster initial setup as well as addition of new 
namespace with new mount entry. If we continue to use mounttable.xml, then 
admins can push the update the same way as viewFs setup. If we use ZK store, 
them we need to provide tools to update state store.
* What is the performance optimization in your latest patch, based on async RPC 
client?

> Router-based HDFS federation
> ----------------------------
>
>                 Key: HDFS-10467
>                 URL: https://issues.apache.org/jira/browse/HDFS-10467
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: fs
>    Affects Versions: 2.7.2
>            Reporter: Inigo Goiri
>            Assignee: Inigo Goiri
>         Attachments: HDFS Router Federation.pdf, HDFS-10467.PoC.001.patch, 
> HDFS-10467.PoC.patch, HDFS-Router-Federation-Prototype.patch
>
>
> Add a Router to provide a federated view of multiple HDFS clusters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to