[ 
https://issues.apache.org/jira/browse/HDFS-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170214#comment-15170214
 ] 

Tsz Wo Nicholas Sze commented on HDFS-7048:
-------------------------------------------

Thanks  Chengbing for working on this.   Some suggestion:

- Since the wait time is at most 1 second, how about we simply change the 
wait(..) to sleep(..) and completely remove the notify(..) calls?
- The new log message may not be useful for common users.  How about removing 
it or changing it to debug?

> Incorrect Dispatcher#Source wait/notify leads to early termination
> ------------------------------------------------------------------
>
>                 Key: HDFS-7048
>                 URL: https://issues.apache.org/jira/browse/HDFS-7048
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: balancer & mover
>    Affects Versions: 2.6.0, 2.7.0
>            Reporter: Andrew Wang
>            Assignee: Chengbing Liu
>         Attachments: HDFS-7048.01.patch
>
>
> Split off from HDFS-6621. The Balancer attempts to wake up scheduler threads 
> early as sources finish, but the synchronization with wait and notify is 
> incorrect. This ticks the failure count, which can lead to early termination.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to