[ 
https://issues.apache.org/jira/browse/HDFS-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15973952#comment-15973952
 ] 

Weizhan Zeng commented on HDFS-7048:
------------------------------------

[~szetszwo] can you help assign to [~chengbing.liu] , I havd test  it in my 
online cluster, and from the out result , it seems work, i will look deep into 
the code , and i also think   [~szetszwo] idea is good , bq. 
Since the wait time is at most 1 second, how about we simply change the 
wait(..) to sleep(..) and completely remove the notify(..) calls?
The new log message may not be useful for common users. How about removing it 
or changing it to debug?


> Incorrect Dispatcher#Source wait/notify leads to early termination
> ------------------------------------------------------------------
>
>                 Key: HDFS-7048
>                 URL: https://issues.apache.org/jira/browse/HDFS-7048
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: balancer & mover
>    Affects Versions: 2.6.0, 2.7.0
>            Reporter: Andrew Wang
>         Attachments: HDFS-7048.01.patch
>
>
> Split off from HDFS-6621. The Balancer attempts to wake up scheduler threads 
> early as sources finish, but the synchronization with wait and notify is 
> incorrect. This ticks the failure count, which can lead to early termination.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to