[jira] [Commented] (HBASE-20188) [TESTING] Performance

Anastasia Braginsky (JIRA) Sun, 18 Mar 2018 01:45:56 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-20188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16403920#comment-16403920
 ]


Anastasia Braginsky commented on HBASE-20188:
---------------------------------------------

{quote}With default memstore, it just a matter of iterating over a map and 
write cells. But now we have to read from multiple segments in a heap way and 
so more compares there.
{quote}
Not surprised with the performance degradation. I saw the flushes not fast 
enough in all the tests I have done. But I do not think it is due to multiple 
segments in the snapshot, at least I tried to run it with 
hbase.hregion.compacting.pipeline.segments.limit equals 1 (single segment per 
snapshot) and saw no much difference. But you can try it yourself, of course.
{quote}  I was wondering what your thoughts were regards our doing MORE 
aggressive in-memory compaction moving Cells from CSLM to your flat structures, 
could it save on the number of overall compares (and hence CPU) or, if not on 
compares, overhead from the CSLM itself shows up as a pretty big CPU user too? 
What you reckon?
{quote}
Indeed (as Eshcar said) the only way to do more in-memory compaction and 
transfer more cells to immutable segments (out of CSLM) is to use 
hbase.memstore.inmemoryflush.threshold.factor to be as small percentage as 
possible. But if you also keep hbase.hregion.compacting.pipeline.segments.limit 
equals 1, it might end with too much segment merges. But we can check this 
option as well.

 

Bottom line, when I was trying write-only scenario on both SSD and HDD I saw it 
consistently hitting "global pressure" mark. Is the Eshcar's fix to HBASE-18294 
included?

> [TESTING] Performance
> ---------------------
>
>                 Key: HBASE-20188
>                 URL: https://issues.apache.org/jira/browse/HBASE-20188
>             Project: HBase
>          Issue Type: Umbrella
>          Components: Performance
>            Reporter: stack
>            Priority: Critical
>             Fix For: 2.0.0
>
>         Attachments: flamegraph-1072.1.svg, flamegraph-1072.2.svg, tree.txt
>
>
> How does 2.0.0 compare to old versions? Is it faster, slower? There is rumor 
> that it is much slower, that the problem is the asyncwal writing. Does 
> in-memory compaction slow us down or speed us up? What happens when you 
> enable offheaping?
> Keep notes here in this umbrella issue. Need to be able to say something 
> about perf when 2.0.0 ships.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (HBASE-20188) [TESTING] Performance

Reply via email to