HDFS Federation: MiniDFSCluster#waitActive() bug causes some tests to fail 
---------------------------------------------------------------------------

                 Key: HDFS-1718
                 URL: https://issues.apache.org/jira/browse/HDFS-1718
             Project: Hadoop HDFS
          Issue Type: Sub-task
          Components: test
    Affects Versions: Federation Branch
            Reporter: Suresh Srinivas
            Assignee: Suresh Srinivas
             Fix For: Federation Branch


MiniDFSCluster#shouldWait() method waits for all the datanodes to come up and 
register with the namenode.

Due to threading issues some of the tests fail for two reasons:
# Datanode#isDatanodeUp() fails even if all the BPOfferService threads have 
exited. This is due to Thread.isAlive()
returning true, even though the thread has exited. Adding a check to 
BPOfferService#shouldService run as an addition,
fixes this issues.
# shouldWait(), where isBPServiceAlive() is called, does not work when a 
BPOfferService thread fails before the
datanode has discovered the BPID, from handshake with namenode. This can be 
fixed by checking the thread state using
InetSocketAddress to determine the BPOfferService, instead of BPID.


-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to