[
https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16124523#comment-16124523
]
Ted Yu commented on HBASE-18541:
--------------------------------
Managed to generate core dump where:
{code}
Core was generated by
`/usr/src/hbase/hbase-native-client/buck-out/gen/core/retry-test
--gtest_color=n'.
Program terminated with signal SIGSEGV, Segmentation fault.
#0 0x00007fe32bc33135 in ?? () from
/usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so
[Current thread is 1 (Thread 0x7fe32c343840 (LWP 19436))]
(gdb) bt
#0 0x00007fe32bc33135 in ?? () from
/usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so
#1 0x00007fe31885053e in ?? ()
#2 0x00000000e0d55920 in ?? ()
#3 0x00000000fefd3110 in ?? ()
#4 0x00007ffdd7ce7210 in ?? ()
#5 0x00007fe32b66dfed in ?? () from
/usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so
#6 0x00007fe318282b10 in ?? ()
#7 0x00000000029426a0 in ?? ()
#8 0x00007fe318282b10 in ?? ()
#9 0x0000000000000010 in ?? ()
#10 0x00007fe32b4b32fd in ?? () from
/usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so
#11 0x00007fe318282b10 in ?? ()
#12 0x0000000000000010 in ?? ()
#13 0x00007fe31840c5c8 in ?? ()
#14 0x00000000fefd3110 in ?? ()
{code}
there was no method from native client shown above.
> [C++] Segfaults from JNI
> ------------------------
>
> Key: HBASE-18541
> URL: https://issues.apache.org/jira/browse/HBASE-18541
> Project: HBase
> Issue Type: Sub-task
> Reporter: Enis Soztutar
> Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break
> 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the
> create table method call. I was not able to inspect much, but the comments in
> our mini-cluster indicate that we may need to use global references instead
> of local ones. I suspect the problem happens when there is a GC run for the
> test since the failure happens usually after some time (but almost always in
> create table method).
> [~ted_yu] do you mind taking a look at this.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)