[ 
https://issues.apache.org/jira/browse/CASSANDRA-11561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239320#comment-15239320
 ] 

T Jake Luciani commented on CASSANDRA-11561:
--------------------------------------------

Branch is https://github.com/tjake/cassandra/tree/faster-streaming

3.5 test:
Finished Streaming in 25 seconds: 18.92 Mb/sec

branch test:
Finished Streaming in 19 seconds: 25.04 Mb/sec

> Improve Streaming Performance
> -----------------------------
>
>                 Key: CASSANDRA-11561
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11561
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Streaming and Messaging
>            Reporter: T Jake Luciani
>            Assignee: T Jake Luciani
>             Fix For: 3.x
>
>
> Inspired by CASSANDRA-9766 I've taken a look at the streaming path and have 
> been able to improve Streaming performance 25% with the following:
> * CompressedStreamReader had only implemented reading a single byte at a 
> time. I added the read(byte[], int, int) method.
> * When writing to sstable, rather than calculate size of row, write the size 
> then write the row (which causes 2x the cpu). Serialize the row to memory 
> then write size of memory buffer and copy buffer to disk.
> * Added object recycling of the largest garbage sources.  Namely, 
> BTreeSearchIterator, and DataOutputBuffer (for above fix). There are still a 
> few more places recycling would help/ like the Object[] in BTree.Builder
> * Changed all ThreadLocals to use FastThreadLocal from netty, and 
> subsequently adding FastThreadLocalThreads for all internal threads.
> There are still more things todo here, like we generate tons of garbage 
> boxing types for StreamingHistogram.
> I've added a long test to stream and compact a large sstable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to