Sam Lightfoot created CASSANDRA-21094:
-----------------------------------------

             Summary: Use POSIX_FADV_SEQUENTIAL for SSTable reads during 
compaction and streaming
                 Key: CASSANDRA-21094
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-21094
             Project: Apache Cassandra
          Issue Type: Improvement
          Components: Local/Compaction
            Reporter: Sam Lightfoot
            Assignee: Sam Lightfoot
             Fix For: 5.x


If we use direct io to read SSTables during compaction, we can avoid polluting 
the page cache with data we're about to delete.  As another side effect, we 
also evict pages to make room for whatever we're putting in.  This unnecessary 
churn leads to higher CPU overhead and can cause dips in client read latency, 
as we're going to be evicting pages that could be used to serve those reads.

This is most notable with STCS as the SSTables get larger, potentially evicting 
the entire hot dataset out of cache, but is affected by every compaction 
strategy.

This is a follow up to be done after CASSANDRA-15452 since we will have an 
internal buffer.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to