[
https://issues.apache.org/jira/browse/CASSANDRA-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jonathan Ellis updated CASSANDRA-1155:
--------------------------------------
Attachment: 1155-v2.txt
v2 does some minor cleanup.
I started replacing CFS min/max w/ aggregated data from histograms (min/max is
not persistent, so the latter is more useful) but the histograms are part of
Writer rather than base SSTable. They need to be available to the Reader to be
useful for CASSANDRA-749. (Although possibly loading them as a simple long[]
in Reader would be better, to avoid unnecessary synchronization overhead on a
read-only structure.)
> keep persistent row statistics
> ------------------------------
>
> Key: CASSANDRA-1155
> URL: https://issues.apache.org/jira/browse/CASSANDRA-1155
> Project: Cassandra
> Issue Type: Sub-task
> Components: Core
> Reporter: Jonathan Ellis
> Assignee: Brandon Williams
> Fix For: 0.7
>
> Attachments: 1155-v2.txt, 1155.txt
>
>
> during flush and compaction we should keep row size statistics using
> EstimatedHistogram (column count, and row size), replacing min/max/total
> sizes in CFS.
> having this detail will let us estimate, given an index CF, how many nodes we
> need to query to get the number of matching rows requested by the client.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.