[ 
https://issues.apache.org/jira/browse/HDFS-7611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14284651#comment-14284651
 ] 

Konstantin Shvachko commented on HDFS-7611:
-------------------------------------------

Was looking at {{TestOpenFilesWithSnapshot}} which also restarts NameNode and 
fails intermittently with the same timeout. I see similar behavior as Byron 
described.
The test creates two files {{/test/test/test2}} and {{/test/test/test3}}, then 
aborts the streams, creates a snapshot, deletes the files, and restarts the the 
NameNode. If any of the replicas of the files were created on any of DNs, then 
the test succeeds. If the stream is aborted before the replicas are created, 
then the test fails.
So some blocks, which were deleted before the NN restart are not being garbage 
collected on restart, and NN cannot get out of safe mode then.
This test does not use truncate, but does use snapshots.

> TestFileTruncate.testTruncateEditLogLoad times out waiting for Mini HDFS 
> Cluster to start
> -----------------------------------------------------------------------------------------
>
>                 Key: HDFS-7611
>                 URL: https://issues.apache.org/jira/browse/HDFS-7611
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 3.0.0
>            Reporter: Konstantin Shvachko
>            Assignee: Byron Wong
>         Attachments: testTruncateEditLogLoad.log
>
>
> I've seen it failing on Jenkins a couple of times. Somehow the cluster is not 
> comming ready after NN restart.
> Not sure if it is truncate specific, as I've seen same behaviour with other 
> tests that restart the NameNode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to