Jacek Lewandowski created CASSANDRA-18201:
---------------------------------------------

             Summary: Store min and max partition key in sstable stats metadata 
rather than in some index component
                 Key: CASSANDRA-18201
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18201
             Project: Cassandra
          Issue Type: Improvement
          Components: Local/SSTable
            Reporter: Jacek Lewandowski
            Assignee: Jacek Lewandowski


Currently min and max partition key is stored in the index summary.

Firstly, that informat better fits stats metadata as there are other similar 
statistics (like min and max clusterings)

Secondly, opening index summary is costly. Though, index summary and the index 
itself are loaded upon opening an sstable just because we need to read min and 
max partition keys. Min and max partition keys need to be know for an sstable 
so that when some data are queried, we can select the sstable which may contain 
that data - that is, whether the queried partition is included in the min/max 
key range of the sstable.

With the proposed solution, we could postpone loading index components to the 
time when the data from such sstable is really requested. It will be enough to 
read lightweight stats metadata as it will be sufficient to know everything 
about data ranges included in that sstable. It will also let to save memory 
used by those components until data from those sstables are requested.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to