[ 
https://issues.apache.org/jira/browse/HDFS-15294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17347493#comment-17347493
 ] 

zhengchenyu commented on HDFS-15294:
------------------------------------

Thanks for this great work! But I have some question, if source directory be 
writting all the time, is it means Federation balance will never exit? 

In our cluster, we have tool like this. We use "distcp diff snapshot" firstly, 
but gave up it. Then I use multi dest nameservice mountable, write to the dst 
nameservice. Then copy the source data to dst. Then I have only one issue: keep 
data consistent , so I submit HDFS-15750.


> Federation balance tool
> -----------------------
>
>                 Key: HDFS-15294
>                 URL: https://issues.apache.org/jira/browse/HDFS-15294
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Jinglun
>            Assignee: Jinglun
>            Priority: Major
>             Fix For: 3.4.0
>
>         Attachments: BalanceProcedureScheduler.png, HDFS-15294.001.patch, 
> HDFS-15294.002.patch, HDFS-15294.003.patch, HDFS-15294.003.reupload.patch, 
> HDFS-15294.004.patch, HDFS-15294.005.patch, HDFS-15294.006.patch, 
> HDFS-15294.007.patch, distcp-balance.pdf, distcp-balance.v2.pdf
>
>
> This jira introduces a new HDFS federation balance tool to balance data 
> across different federation namespaces. It uses Distcp to copy data from the 
> source path to the target path.
> The process is:
>  1. Use distcp and snapshot diff to sync data between src and dst until they 
> are the same.
>  2. Update mount table in Router if we specified RBF mode.
>  3. Deal with src data, move to trash, delete or skip them.
> The design of fedbalance tool comes from the discussion in HDFS-15087.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to