[ 
https://issues.apache.org/jira/browse/HDFS-12711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16219020#comment-16219020
 ] 

Allen Wittenauer commented on HDFS-12711:
-----------------------------------------

The problem that we're seeing is that if we run the full gamut of Yetus tests 
on branch-2, it invariably hard crashes the node on the ASF build 
infrastructure.  Always. Every time.  It also always happen during the hdfs 
unit tests. I'm attempting to isolate it to see if it is *just* the HDFS unit 
tests that trigger the crash or if I need to run through common, etc, first 
too.  This will do two things:

* drop the reproducible test case down from 5+ hours to 1+ hours
* confirm that's it is entirely in the hdfs unit tests and not from something 
else in the path

My ultimate goal is to get Yetus to configure the Docker container to at least 
prevent the crashes.  But I need a 'faster' way to test...

FWIW, I'm still unsure if it is a kernel panic or just breaking Jenkins enough 
that it thinks the node is catatonic.  From the *one* time I was able to see 
logs before they disappeared, there were a ton of OOM errors, a core dump, and 
more.  This leads me to believe that it is likely a kernel panic caused by the 
OOM killer going nuts, since it's well established how badly the Linux kernel 
behaves under low mem.  (Thus why I can't really test at home either... I'm not 
using Linux on my "big box"....)

> deadly hdfs test
> ----------------
>
>                 Key: HDFS-12711
>                 URL: https://issues.apache.org/jira/browse/HDFS-12711
>             Project: Hadoop HDFS
>          Issue Type: Test
>            Reporter: Allen Wittenauer
>         Attachments: HDFS-12711.branch-2.00.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to