[
https://issues.apache.org/jira/browse/CASSANDRA-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100449#comment-13100449
]
Benoit Perroud commented on CASSANDRA-3141:
-------------------------------------------
Iterating through a list of 1'000'000 of elements takes obviously time.
But I agree with both of you :
- it's a premature optimization, I will try with CASSANDRA-2843 first
- the way I use SSTSUW is not completely appropriate, I have way better result
doing one key after the other.
> SSTableSimpleUnsortedWriter call to ColumnFamily.serializedSize iterate
> through the whole columns
> -------------------------------------------------------------------------------------------------
>
> Key: CASSANDRA-3141
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3141
> Project: Cassandra
> Issue Type: Improvement
> Components: Core
> Affects Versions: 0.8.3
> Reporter: Benoit Perroud
> Priority: Minor
> Fix For: 0.8.6
>
> Attachments: CachedSizeCF.patch
>
>
> Every time newRow is called, serializedSize iterate through all the columns
> to compute the size.
> Once 1'000'000 columns exist in the CF, it becomes painfull to do at every
> iteration the same computation. Caching the size and incrementing when a
> Column is added could be an option.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira