[
https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16121916#comment-16121916
]
Ted Yu commented on HBASE-18541:
--------------------------------
Sometimes the retry-test ended with:
{code}
[ RUN ] AsyncRpcRetryTest.TestFailWithOperationTimeout
2017-08-10 17:01:54,177 INFO
[RpcServer.FifoWFPBQ.default.handler=1,queue=1,port=41153] master.HMaster
(HMaster.java:createTable(1530)) - Client=root//172.17.0.2 create 'table6',
{NAME => 'd', BLOOMFILTER => 'NONE', VERSIONS => '1', IN_MEMORY => 'false',
KEEP_DELETED_CELLS => 'FALSE', IN_MEMORY_COMPACTION => 'false',
DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER', COMPRESSION => 'NONE',
MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536',
REPLICATION_SCOPE => '0'}
2017-08-10 17:01:54,285 INFO [ProcessThread(sid:0 cport:55375):]
server.PrepRequestProcessor (PrepRequestProcessor.java:pRequest(651)) - Got
user-level KeeperException when processing sessionid:0x15dcd181a4d0000
type:create cxid:0xb5 zxid:0x5a txntype:-1 reqpath:n/a Error
Path:/hbase/table-lock/table6 Error:KeeperErrorCode = NoNode for
/hbase/table-lock/table6
2017-08-10 17:01:54,495 INFO [RegionOpenAndInitThread-table6-1]
regionserver.HRegion (HRegion.java:createHRegion(6282)) - creating HRegion
table6 HTD == 'table6', {NAME => 'd', BLOOMFILTER => 'NONE', VERSIONS => '1',
IN_MEMORY => 'false', KEEP_DELETED_CELLS => 'FALSE', IN_MEMORY_COMPACTION =>
'false', DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER', COMPRESSION =>
'NONE', MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536',
REPLICATION_SCOPE => '0'} RootDir =
file:/usr/src/hbase/hbase-native-client/target/test-data/2db36bc6-e3b0-433a-b812-fc48cd42fd23/.tmp
Table name == table6
2017-08-10 17:01:54,542 INFO [RegionOpenAndInitThread-table6-1]
regionserver.HRegion (HRegion.java:doClose(1590)) - Closed
table6,,1502384514176.f9265e2eb45f3087f533c45ab1eaaaa5.
2017-08-10 17:01:54,653 INFO [ProcedureExecutor-0] hbase.MetaTableAccessor
(MetaTableAccessor.java:addRegionsToMeta(1571)) - Added 1
{code}
However, AsyncRpcRetryTest.TestFailWithOperationTimeout passes when run
individually.
> [C++] Segfaults from JNI
> ------------------------
>
> Key: HBASE-18541
> URL: https://issues.apache.org/jira/browse/HBASE-18541
> Project: HBase
> Issue Type: Sub-task
> Reporter: Enis Soztutar
> Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break
> 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the
> create table method call. I was not able to inspect much, but the comments in
> our mini-cluster indicate that we may need to use global references instead
> of local ones. I suspect the problem happens when there is a GC run for the
> test since the failure happens usually after some time (but almost always in
> create table method).
> [~ted_yu] do you mind taking a look at this.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)