[
https://issues.apache.org/jira/browse/HDFS-615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12870045#action_12870045
]
Steve Loughran commented on HDFS-615:
-------------------------------------
I don't know the root cause of the original stack, mine is slightly different
and triggered by a null NN which only happens sometimes, and which even minor
changes to the code/system use cause to go away. This makes me think its
triggered by the namenode being set to null when the NN shuts down, which would
explain why the assertion only kicks in after 60-90 seconds: it's not a test
failure so much as race conditions in the shutdown logic.
> TestLargeDirectoryDelete fails with NullPointerException
> --------------------------------------------------------
>
> Key: HDFS-615
> URL: https://issues.apache.org/jira/browse/HDFS-615
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: name-node
> Affects Versions: 0.21.0
> Environment: 64-bit debian 5, 64-bit sun java6, running in a single
> processor VM.
> Reporter: Eli Collins
> Priority: Minor
> Attachments: HDFS-615.patch, TestLargeDirectoryDelete.zip
>
>
> I've hit the following failure two out of two times running "ant test" at rev
> 813587. This test doesn't appear to be failing on hudson. All other tests
> passed except TestHDFSFileSystemContract which timed out, so perhaps there's
> a race due to the test executing slowly.
> [junit] Running
> org.apache.hadoop.hdfs.server.namenode.TestLargeDirectoryDelete
> [junit] Exception in thread "Thread-30148" java.lang.NullPointerException
> [junit] at
> org.apache.hadoop.hdfs.server.namenode.NameNodeAdapter.getNamesystem(NameNodeAdapter.java:32)
> [junit] at
> org.apache.hadoop.hdfs.MiniDFSCluster.getNamesystem(MiniDFSCluster.java:522)
> [junit] at
> org.apache.hadoop.hdfs.server.namenode.TestLargeDirectoryDelete.getBlockCount(TestLargeDirectoryDelete.java:75)
> [junit] at
> org.apache.hadoop.hdfs.server.namenode.TestLargeDirectoryDelete.access$000(TestLargeDirectoryDelete.java:38)
> [junit] at
> org.apache.hadoop.hdfs.server.namenode.TestLargeDirectoryDelete$1.run(TestLargeDirectoryDelete.java:90)
> [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 94.264 sec
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ No failures
> or errors?
> public static FSNamesystem getNamesystem(NameNode namenode) {
> return namenode.getNamesystem();
> <===========
> }
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.