[ 
https://issues.apache.org/jira/browse/HBASE-6679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13463548#comment-13463548
 ] 

Devaraj Das commented on HBASE-6679:
------------------------------------

bq. For sure the regions was not doubly-assigned? Split happened of the region 
on one server but on another server, the same region was being compacted? You'd 
need the master logs to figure it a dbl-assign

Unfortunately, didn't save the master logs when the failure happened.. 

bq. Can you figure a place where we'd be running compactions on a region 
concurrent w/ our splitting it? Compacting we take out write lock. Doesnt look 
like any locks while SplitTransaction is running (closing parent, it'll need 
write lock... thats after daughters open though).

I can't figure out a place where this could happen in the natural execution of 
the regionserver.

bq. Storefiles are an ImmutableList.

Yes.. but that still could be exposed to the problems of memory inconsistencies 
when multiple threads are accessing the object in unsynchronized/non-volatile 
ways, no?

bq. @Deva

After a long time, someone addressed me by that name :-)

bq. So before this itself the region got closed. I feel the store file list 
should have been updated by the time. No ?

Can't say Ram for sure. There is no guarantee unless the access (read/write) 
are synchronized or the field is declared volatile..

                
> RegionServer aborts due to race between compaction and split
> ------------------------------------------------------------
>
>                 Key: HBASE-6679
>                 URL: https://issues.apache.org/jira/browse/HBASE-6679
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>             Fix For: 0.92.3
>
>         Attachments: rs-crash-parallel-compact-split.log
>
>
> In our nightlies, we have seen RS aborts due to compaction and split racing. 
> Original parent file gets deleted after the compaction, and hence, the 
> daughters don't find the parent data file. The RS kills itself when this 
> happens. Will attach a snippet of the relevant RS logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to