[ https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16124523#comment-16124523 ]
Ted Yu commented on HBASE-18541: -------------------------------- Managed to generate core dump where: {code} Core was generated by `/usr/src/hbase/hbase-native-client/buck-out/gen/core/retry-test --gtest_color=n'. Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00007fe32bc33135 in ?? () from /usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so [Current thread is 1 (Thread 0x7fe32c343840 (LWP 19436))] (gdb) bt #0 0x00007fe32bc33135 in ?? () from /usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so #1 0x00007fe31885053e in ?? () #2 0x00000000e0d55920 in ?? () #3 0x00000000fefd3110 in ?? () #4 0x00007ffdd7ce7210 in ?? () #5 0x00007fe32b66dfed in ?? () from /usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so #6 0x00007fe318282b10 in ?? () #7 0x00000000029426a0 in ?? () #8 0x00007fe318282b10 in ?? () #9 0x0000000000000010 in ?? () #10 0x00007fe32b4b32fd in ?? () from /usr/lib/jvm/java-8-openjdk-amd64//jre/lib/amd64/server/libjvm.so #11 0x00007fe318282b10 in ?? () #12 0x0000000000000010 in ?? () #13 0x00007fe31840c5c8 in ?? () #14 0x00000000fefd3110 in ?? () {code} there was no method from native client shown above. > [C++] Segfaults from JNI > ------------------------ > > Key: HBASE-18541 > URL: https://issues.apache.org/jira/browse/HBASE-18541 > Project: HBase > Issue Type: Sub-task > Reporter: Enis Soztutar > Assignee: Ted Yu > > retry-test and multi-retry-test fails flakily when run with > {code} > buck test --all --no-results-cache > {code} > or when run in a loop: > {code} > for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break > 1; done > {code} > The problem seems to be within the JNI internals and usually happens at the > create table method call. I was not able to inspect much, but the comments in > our mini-cluster indicate that we may need to use global references instead > of local ones. I suspect the problem happens when there is a GC run for the > test since the failure happens usually after some time (but almost always in > create table method). > [~ted_yu] do you mind taking a look at this. -- This message was sent by Atlassian JIRA (v6.4.14#64029)