[ 
https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16125890#comment-16125890
 ] 

Ted Yu commented on HBASE-18541:
--------------------------------

Another instance of segfault:
{code}
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x00007fb387315dc8 in os::write_memory_serialize_page (thread=0x2af3000) at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp:419
419     
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp:
 No such file or directory.
[Current thread is 1 (Thread 0x7fb387dbe840 (LWP 9221))]
Installing openjdk unwinder
(gdb) bt
#0  0x00007fb387315dc8 in 
ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState, 
JavaThreadState) (thread=0x2af3000)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp:419
#1  0x00007fb387315dc8 in 
ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState, 
JavaThreadState) (thread=0x2af3000)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2  0x00007fb387315dc8 in 
ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState, 
JavaThreadState) (thread=thread@entry=0x2af3000, to=_thread_in_native, 
from=_thread_in_vm) at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:179
#3  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) 
(to=_thread_in_native, from=_thread_in_vm, this=<synthetic pointer>)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:232
#4  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) 
(this=<synthetic pointer>, __in_chrg=<optimized out>)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:281
#5  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) 
(env=<optimized out>, receiver=receiver@entry=0x7ffde93448a0)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/prims/jvm.cpp:516
#6  0x00007fb38395e851 in Java_java_lang_Throwable_fillInStackTrace 
(env=<optimized out>, throwable=0x7ffde93448a0, dummy=<optimized out>)
    at 
/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/jdk/src/share/native/java/lang/Throwable.c:49
#7  0x00007fb373eb9a28 in [native offset=0xa8] 
java.lang.Throwable.fillInStackTrace(int) () at java/lang/Throwable.java
#8  0x00007fb3743472a4 in [compiled offset=0x84] 
java.lang.Throwable.fillInStackTrace() () at java/lang/Throwable.java:781
#9  0x00007fb3743bc914 in [compiled offset=0x194] java.lang.Throwable.<init>() 
() at java/lang/Throwable.java:249
#10 0x00007fb37421a0d4 in [compiled offset=0x1b4] 
org.apache.log4j.helpers.PatternParser$LocationPatternConverter.convert(org.apache.log4j.spi.LoggingEvent)
 ()
    at org/apache/log4j/helpers/PatternParser.java:500
#11 0x00007fb37417eab4 in [compiled offset=0x114] 
org.apache.log4j.helpers.PatternConverter.format(java.lang.StringBuffer,org.apache.log4j.spi.LoggingEvent)
 ()
    at org/apache/log4j/helpers/PatternConverter.java:65
#12 0x00007fb37426315c in [inlined] java.lang.StringBuffer.setLength(int) () at 
java/lang/StringBuffer.java:193
0x00007fb37426315c in [compiled offset=0x71c] 
org.apache.log4j.PatternLayout.format(org.apache.log4j.spi.LoggingEvent) () at 
org/apache/log4j/PatternLayout.java:503
#13 0x00007fb37454484c in [compiled offset=0x12c] 
org.apache.log4j.WriterAppender.subAppend(org.apache.log4j.spi.LoggingEvent) () 
at org/apache/log4j/WriterAppender.java:310
#14 0x00007fb374538aac in [compiled offset=0x1ec] 
org.apache.log4j.WriterAppender.append(org.apache.log4j.spi.LoggingEvent) () at 
org/apache/log4j/WriterAppender.java:160
#15 0x00007fb37454793c in [compiled offset=0x113c] 
org.apache.log4j.AppenderSkeleton.doAppend(org.apache.log4j.spi.LoggingEvent) ()
    at org/apache/log4j/AppenderSkeleton.java:251
#16 0x00007fb374074204 in [compiled offset=0x4c4] 
org.apache.log4j.helpers.AppenderAttachableImpl.appendLoopOnAppenders(org.apache.log4j.spi.LoggingEvent)
 ()
    at org/apache/log4j/helpers/AppenderAttachableImpl.java:66
#17 0x00007fb3742b5f24 in [compiled offset=0x1e4] 
org.apache.log4j.Category.callAppenders(org.apache.log4j.spi.LoggingEvent) () 
at org/apache/log4j/Category.java:200
#18 0x00007fb374208d5c in [inlined] 
org.apache.log4j.Category.forcedLog(java.lang.String,org.apache.log4j.Priority,java.lang.Object,java.lang.Throwable)
 ()
    at org/apache/log4j/Category.java:392
0x00007fb374208d5c in [compiled offset=0x67c] 
org.apache.log4j.Category.log(java.lang.String,org.apache.log4j.Priority,java.lang.Object,java.lang.Throwable)
 ()
    at org/apache/log4j/Category.java:858
#19 0x00007fb37454b374 in [compiled offset=0x154] 
org.apache.commons.logging.impl.Log4JLogger.info(java.lang.Object) () at 
org/apache/commons/logging/impl/Log4JLogger.java:177
#20 0x00007fb373cee042 in [interpreted: bc = 50] 
org.apache.hadoop.hbase.regionserver.HRegionServer.stop(java.lang.String) ()
    at org/apache/hadoop/hbase/regionserver/HRegionServer.java:1925
{code}
No zookeeper involved. But the line number in os.hpp was the same.

> [C++] Segfaults from JNI
> ------------------------
>
>                 Key: HBASE-18541
>                 URL: https://issues.apache.org/jira/browse/HBASE-18541
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Enis Soztutar
>            Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with 
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 
> 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the 
> create table method call. I was not able to inspect much, but the comments in 
> our mini-cluster indicate that we may need to use global references instead 
> of local ones. I suspect the problem happens when there is a GC run for the 
> test since the failure happens usually after some time (but almost always in 
> create table method). 
> [~ted_yu] do you mind taking a look at this. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to