[
https://issues.apache.org/jira/browse/CASSANDRA-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Brandon Williams updated CASSANDRA-1155:
----------------------------------------
Attachment: 1155-v5.txt
v5 moves the stats write after the descriptor is renamed, because
persistSSTableStatistics takes care not to store stats for temporary files.
Also adds StatisticsTable back which was accidentally removed in v4, fixes some
bugs in EH/CFS min/mean/max calculation, and has SSTW pass its EHs directly to
SSTR when calling internalOpen. Adds a test for SSTR providing stats, and more
tests for EH.
> keep persistent row statistics
> ------------------------------
>
> Key: CASSANDRA-1155
> URL: https://issues.apache.org/jira/browse/CASSANDRA-1155
> Project: Cassandra
> Issue Type: Sub-task
> Components: Core
> Reporter: Jonathan Ellis
> Assignee: Brandon Williams
> Fix For: 0.7 beta 1
>
> Attachments: 1155-v2.txt, 1155-v3.txt, 1155-v4.txt, 1155-v5.txt,
> 1155.txt
>
>
> during flush and compaction we should keep row size statistics using
> EstimatedHistogram (column count, and row size), replacing min/max/total
> sizes in CFS.
> having this detail will let us estimate, given an index CF, how many nodes we
> need to query to get the number of matching rows requested by the client.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.