[ 
https://issues.apache.org/jira/browse/CASSANDRA-5555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita reopened CASSANDRA-5555:
---------------------------------------


This fix can send wrong "estimated number of keys" for creating BF on the 
streamed node, since calculating estimate uses index summary.

My proposed fix is to make index summary completely optional. That is, when 
Summary.db file is present, load that and use it. We also add an option not to 
load Summary.db. And when the file is not present nor the user choose not to 
load the summary, we just scan sequentially on index file(Index.db) for 
"estimated number of keys".


                
> Allow sstableloader to handle a larger number of files
> ------------------------------------------------------
>
>                 Key: CASSANDRA-5555
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5555
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core, Tools
>            Reporter: Tyler Hobbs
>            Assignee: Jonathan Ellis
>             Fix For: 1.2.6
>
>         Attachments: 5555-01.txt, 5555-02.txt, CASSANDRA-5555.txt, 
> CASSANDRA-5555.txt
>
>
> With the default heap size, sstableloader will OOM when there are roughly 25k 
> files in the directory to load.  It's easy to reach this number of files in a 
> single LCS column family.
> By avoiding creating all SSTableReaders up front in SSTableLoader, we should 
> be able to increase the number of files that sstableloader can handle 
> considerably.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to