[ 
https://issues.apache.org/jira/browse/CASSANDRA-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-1155:
--------------------------------------

    Attachment: 1155-v2.txt

v2 does some minor cleanup.

I started replacing CFS min/max w/ aggregated data from histograms (min/max is 
not persistent, so the latter is more useful) but the histograms are part of 
Writer rather than base SSTable.  They need to be available to the Reader to be 
useful for CASSANDRA-749.  (Although possibly loading them as a simple long[] 
in Reader would be better, to avoid unnecessary synchronization overhead on a 
read-only structure.)


> keep persistent row statistics
> ------------------------------
>
>                 Key: CASSANDRA-1155
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1155
>             Project: Cassandra
>          Issue Type: Sub-task
>          Components: Core
>            Reporter: Jonathan Ellis
>            Assignee: Brandon Williams
>             Fix For: 0.7
>
>         Attachments: 1155-v2.txt, 1155.txt
>
>
> during flush and compaction we should keep row size statistics using 
> EstimatedHistogram (column count, and row size), replacing min/max/total 
> sizes in CFS.
> having this detail will let us estimate, given an index CF, how many nodes we 
> need to query to get the number of matching rows requested by the client.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to