T Jake Luciani created CASSANDRA-11561:
------------------------------------------

             Summary: Improve Streaming Performance
                 Key: CASSANDRA-11561
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11561
             Project: Cassandra
          Issue Type: Bug
          Components: Streaming and Messaging
            Reporter: T Jake Luciani
            Assignee: T Jake Luciani
             Fix For: 3.x


Inspired by CASSANDRA-9766 I've taken a look at the streaming path and have 
been able to improve Streaming performance 25% with the following:

* CompressedStreamReader had only implemented reading a single byte at a time. 
I added the read(byte[], int, int) method.

* When writing to sstable, rather than calculate size of row, write the size 
then write the row (which causes 2x the cpu). Serialize the row to memory then 
write size of memory buffer and copy buffer to disk.

* Added object recycling of the largest garbage sources.  Namely, 
BTreeSearchIterator, and DataOutputBuffer (for above fix). There are still a 
few more places recycling would help/ like the Object[] in BTree.Builder

* Changed all ThreadLocals to use FastThreadLocal from netty, and subsequently 
adding FastThreadLocalThreads for all internal threads.

There are still more things todo here, like we generate tons of garbage boxing 
types for StreamingHistogram.

I've added a long test to stream and compact a large sstable.







--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to