[ 
https://issues.apache.org/jira/browse/HDFS-15640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17218259#comment-17218259
 ] 

Jinglun commented on HDFS-15640:
--------------------------------

Hi [~linyiqun], thanks your nice comments and clear explanation !  I agree with 
you. A downside of the original time threshold is it creates an illusion that 
the write-blocking could finish within the specified time. Using the number of 
diff entries is clearer.

Upload v02 using the number of diff entries. Pending jenkins.

> RBF: Add fast distcp threshold to FedBalance.
> ---------------------------------------------
>
>                 Key: HDFS-15640
>                 URL: https://issues.apache.org/jira/browse/HDFS-15640
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Jinglun
>            Assignee: Jinglun
>            Priority: Major
>         Attachments: HDFS-15640.001.patch, HDFS-15640.002.patch
>
>
> Currently in the DistCpProcedure it must submit distcp round by round until 
> there is no diff to go to the final distcp stage. The condition is very 
> strict. If the distcp could finish in an acceptable period then we don't need 
> to wait for no diff. For example if 3 consecutive distcp jobs all finish 
> within 10 minutes then we can predict the final distcp could also finish 
> within 10 minutes. So we can start the final distcp directly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to