[
https://issues.apache.org/jira/browse/HDFS-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170214#comment-15170214
]
Tsz Wo Nicholas Sze commented on HDFS-7048:
-------------------------------------------
Thanks Chengbing for working on this. Some suggestion:
- Since the wait time is at most 1 second, how about we simply change the
wait(..) to sleep(..) and completely remove the notify(..) calls?
- The new log message may not be useful for common users. How about removing
it or changing it to debug?
> Incorrect Dispatcher#Source wait/notify leads to early termination
> ------------------------------------------------------------------
>
> Key: HDFS-7048
> URL: https://issues.apache.org/jira/browse/HDFS-7048
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: balancer & mover
> Affects Versions: 2.6.0, 2.7.0
> Reporter: Andrew Wang
> Assignee: Chengbing Liu
> Attachments: HDFS-7048.01.patch
>
>
> Split off from HDFS-6621. The Balancer attempts to wake up scheduler threads
> early as sources finish, but the synchronization with wait and notify is
> incorrect. This ticks the failure count, which can lead to early termination.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)