T Jake Luciani created CASSANDRA-11561:
------------------------------------------
Summary: Improve Streaming Performance
Key: CASSANDRA-11561
URL: https://issues.apache.org/jira/browse/CASSANDRA-11561
Project: Cassandra
Issue Type: Bug
Components: Streaming and Messaging
Reporter: T Jake Luciani
Assignee: T Jake Luciani
Fix For: 3.x
Inspired by CASSANDRA-9766 I've taken a look at the streaming path and have
been able to improve Streaming performance 25% with the following:
* CompressedStreamReader had only implemented reading a single byte at a time.
I added the read(byte[], int, int) method.
* When writing to sstable, rather than calculate size of row, write the size
then write the row (which causes 2x the cpu). Serialize the row to memory then
write size of memory buffer and copy buffer to disk.
* Added object recycling of the largest garbage sources. Namely,
BTreeSearchIterator, and DataOutputBuffer (for above fix). There are still a
few more places recycling would help/ like the Object[] in BTree.Builder
* Changed all ThreadLocals to use FastThreadLocal from netty, and subsequently
adding FastThreadLocalThreads for all internal threads.
There are still more things todo here, like we generate tons of garbage boxing
types for StreamingHistogram.
I've added a long test to stream and compact a large sstable.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)