[
https://issues.apache.org/jira/browse/HBASE-16608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15586767#comment-15586767
]
Anastasia Braginsky commented on HBASE-16608:
---------------------------------------------
Hi [~anoop.hbase] and [~ram_krish],
I first want to thank you for your unstoppable interest and support for this
big CompactingMemStore issue! You are getting everything very fast and see it
all in the deep details. Really pleasure to work with such smart people as you
are.
So you are saying that you want not to use any merge in order to have the
things simpler. Am I getting you right? I do not see how merges (especially if
done once in a while) can affect GC. The size of CellArrayMap is negligible
relative to the data size. It is reasonable to think that GC is busy with
releasing the chunks that are not accessible after flush to disk.
When we hold more data in the memory either due to index or data compaction we
flush bigger files to the disk and bigger chunks of memory are need to be freed
upon each flush to disk. This sounds like a reason for making GC to work
harder. If you see how merges directly affect garbage collection please
explain. All this makes me think that your suggestion (not to merge) will
result in the same GC behavior. We already had an implementation with
flattening only, and there (under stress) we have seen tens of segments in the
compaction pipeline. I wonder the performance of the gets and scans in this
structure. The process of flushing multiple segments to disk should also
prolong the flushing to disk, which is undesirable. When we had flushes waiting
for merge Ram had seen lots of blocking writes till the system became
unresponsive. So I do not clearly see why not-to-merge-at-all is a better
solution.
I am not attached to anything, but really trying to understand why one way is
better then the other. I mean can we measure the better quality of not merging
relative to intermediate merging?
Regarding the issue of N-MSLABs merge, raised in the RB, it looks irrelevant
now, till we decide whether to merge or not to merge. Thus let's discuss it
later. I also believe this is not such a big issue and we can arrange it this
way or another.
Best,
Anastasia
> Introducing the ability to merge ImmutableSegments without copy-compaction or
> SQM usage
> ---------------------------------------------------------------------------------------
>
> Key: HBASE-16608
> URL: https://issues.apache.org/jira/browse/HBASE-16608
> Project: HBase
> Issue Type: Sub-task
> Reporter: Anastasia Braginsky
> Assignee: Anastasia Braginsky
> Attachments: HBASE-16417-V02.patch, HBASE-16417-V04.patch,
> HBASE-16417-V06.patch, HBASE-16417-V07.patch, HBASE-16417-V08.patch,
> HBASE-16417-V10.patch, HBASE-16608-V01.patch, HBASE-16608-V03.patch,
> HBASE-16608-V04.patch, HBASE-16608-V08.patch
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)