[jira] [Commented] (CASSANDRA-15389) Minimize BTree iterator allocations

Benedict Elliott Smith (Jira) Fri, 10 Jan 2020 17:37:55 -0800


    [ 
https://issues.apache.org/jira/browse/CASSANDRA-15389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17013314#comment-17013314
 ]


Benedict Elliott Smith commented on CASSANDRA-15389:
----------------------------------------------------

Sure, it's probably easier to show, so take a look at [this 
branch|https://github.com/belliottsmith/cassandra/commits/15389-suggest].  It's 
late and my OCD got the better of me, so I also tweaked the method parameter 
order so that types of parameter are grouped together, separated by the 
recipient type, i.e. they go {{accumulator, accumulatorArg, Comparator, 
comparatorArg, initialValue}}.  Debatably initialValue should go after 
{{accumulatorArg}}, but it felt right to be consistently at the end.  Feel free 
to use/discard what you feel inclined towards.

> Minimize BTree iterator allocations
> -----------------------------------
>
>                 Key: CASSANDRA-15389
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-15389
>             Project: Cassandra
>          Issue Type: Sub-task
>          Components: Local/Compaction
>            Reporter: Blake Eggleston
>            Assignee: Blake Eggleston
>            Priority: Normal
>             Fix For: 4.0
>
>
> Allocations of BTree iterators contribute a lot amount of garbage to the 
> compaction and read paths.
> This patch removes most btree iterator allocations on hot paths by:
>  • using Row#apply where appropriate on frequently called methods 
> (Row#digest, Row#validateData
>  • adding BTree accumulate method. Like the apply method, this method walks 
> the btree with a function that takes and returns a long argument, this 
> eliminates iterator allocations without adding helper object allocations 
> (BTreeRow#hasComplex, BTreeRow#hasInvalidDeletions, BTreeRow#dataSize, 
> BTreeRow#unsharedHeapSizeExcludingData, Rows#collectStats, 
> UnfilteredSerializer#serializedRowBodySize) as well as eliminating the 
> allocation of helper objects in places where apply was used previously^[1]^.
>  • Create map of columns in SerializationHeader, this lets us avoid 
> allocating a btree search iterator for each row we serialize.
> These optimizations reduce garbage created during compaction by up to 13.5%
>  
> [1] the memory test does measure memory allocated by lambdas capturing objects



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (CASSANDRA-15389) Minimize BTree iterator allocations

Reply via email to