[ 
https://issues.apache.org/jira/browse/CASSANDRA-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100449#comment-13100449
 ] 

Benoit Perroud commented on CASSANDRA-3141:
-------------------------------------------

Iterating through a list of 1'000'000 of elements takes obviously time. 

But I agree with both of you : 
- it's a premature optimization, I will try with CASSANDRA-2843 first
- the way I use SSTSUW is not completely appropriate, I have way better result 
doing one key after the other.


> SSTableSimpleUnsortedWriter call to ColumnFamily.serializedSize iterate 
> through the whole columns
> -------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-3141
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-3141
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.8.3
>            Reporter: Benoit Perroud
>            Priority: Minor
>             Fix For: 0.8.6
>
>         Attachments: CachedSizeCF.patch
>
>
> Every time newRow is called, serializedSize iterate through all the columns 
> to compute the size.
> Once 1'000'000 columns exist in the CF, it becomes painfull to do at every 
> iteration the same computation. Caching the size and incrementing when a 
> Column is added could be an option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to