[jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI

Ted Yu (JIRA) Thu, 10 Aug 2017 10:04:19 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16121916#comment-16121916
 ]


Ted Yu commented on HBASE-18541:
--------------------------------

Sometimes the retry-test ended with:
{code}
[ RUN      ] AsyncRpcRetryTest.TestFailWithOperationTimeout
2017-08-10 17:01:54,177 INFO  
[RpcServer.FifoWFPBQ.default.handler=1,queue=1,port=41153] master.HMaster 
(HMaster.java:createTable(1530)) - Client=root//172.17.0.2 create 'table6', 
{NAME => 'd', BLOOMFILTER => 'NONE', VERSIONS => '1', IN_MEMORY => 'false', 
KEEP_DELETED_CELLS => 'FALSE', IN_MEMORY_COMPACTION => 'false', 
DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER', COMPRESSION => 'NONE', 
MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536', 
REPLICATION_SCOPE => '0'}
2017-08-10 17:01:54,285 INFO  [ProcessThread(sid:0 cport:55375):] 
server.PrepRequestProcessor (PrepRequestProcessor.java:pRequest(651)) - Got 
user-level KeeperException when processing sessionid:0x15dcd181a4d0000 
type:create cxid:0xb5 zxid:0x5a txntype:-1 reqpath:n/a Error 
Path:/hbase/table-lock/table6 Error:KeeperErrorCode = NoNode for 
/hbase/table-lock/table6
2017-08-10 17:01:54,495 INFO  [RegionOpenAndInitThread-table6-1] 
regionserver.HRegion (HRegion.java:createHRegion(6282)) - creating HRegion 
table6 HTD == 'table6', {NAME => 'd', BLOOMFILTER => 'NONE', VERSIONS => '1', 
IN_MEMORY => 'false', KEEP_DELETED_CELLS => 'FALSE', IN_MEMORY_COMPACTION => 
'false', DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER', COMPRESSION => 
'NONE', MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536', 
REPLICATION_SCOPE => '0'} RootDir = 
file:/usr/src/hbase/hbase-native-client/target/test-data/2db36bc6-e3b0-433a-b812-fc48cd42fd23/.tmp
 Table name == table6
2017-08-10 17:01:54,542 INFO  [RegionOpenAndInitThread-table6-1] 
regionserver.HRegion (HRegion.java:doClose(1590)) - Closed 
table6,,1502384514176.f9265e2eb45f3087f533c45ab1eaaaa5.
2017-08-10 17:01:54,653 INFO  [ProcedureExecutor-0] hbase.MetaTableAccessor 
(MetaTableAccessor.java:addRegionsToMeta(1571)) - Added 1
{code}
However, AsyncRpcRetryTest.TestFailWithOperationTimeout passes when run 
individually.

> [C++] Segfaults from JNI
> ------------------------
>
>                 Key: HBASE-18541
>                 URL: https://issues.apache.org/jira/browse/HBASE-18541
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Enis Soztutar
>            Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with 
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 
> 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the 
> create table method call. I was not able to inspect much, but the comments in 
> our mini-cluster indicate that we may need to use global references instead 
> of local ones. I suspect the problem happens when there is a GC run for the 
> test since the failure happens usually after some time (but almost always in 
> create table method). 
> [~ted_yu] do you mind taking a look at this. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI

Reply via email to