Daryn Sharp created HDFS-12142:
----------------------------------

             Summary: Files may be closed before streamer is done
                 Key: HDFS-12142
                 URL: https://issues.apache.org/jira/browse/HDFS-12142
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: hdfs
    Affects Versions: 2.8.0
            Reporter: Daryn Sharp


We're encountering multiple cases of clients calling updateBlockForPipeline on 
completed blocks.  Initial analysis is the client closes a file, completeFile 
succeeds, then it immediately attempts recovery.  The exception is swallowed on 
the client, only logged on the NN by checkUCBlock.

The problem "appears" to be benign (no data loss) but it's unproven if the 
issue always occurs for successfully closed files.  There appears to be very 
poor coordination between the dfs output stream's threads which leads to races 
that confuse the streamer thread – which probably should have been joined 
before returning from close.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to