[ 
https://issues.apache.org/jira/browse/CASSANDRA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833657#action_12833657
 ] 

Stu Hood commented on CASSANDRA-792:
------------------------------------

> how is this different from existing min/max compaction files? sounds like 
> just a rename to me from that quote. 
There is one other difference: mergeFactor also determines the size of the 
buckets that sstables are compacted in.

> My initial reaction is that this is not something we want to do.
I think we may be forced to. Without throttling writes to a level that 
compactions can sustain, you can run into situations where you have many 
thousands of sstables on disk, which leads to abysmal read performance. If 
people _want_ to be in this situation, then we can allow them to disable 
compaction, but otherwise, compaction needs to be able to keep up.

> Configurable 'merge factor' for Column Families
> -----------------------------------------------
>
>                 Key: CASSANDRA-792
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-792
>             Project: Cassandra
>          Issue Type: New Feature
>            Reporter: Stu Hood
>             Fix For: 0.6
>
>         Attachments: 
> 0001-Use-logarithmic-bucket-sizes-and-always-compact-the-.patch, 
> 0002-Cap-memory-usage-for-all-SSTables-being-compacted.patch, 
> 0003-Hardcoded-per-cf-merge-factor.patch, 0004-Configurable-MergeFactor.patch
>
>
> Apache Lucene flushes and merges 'segments' in a manner that is quite similar 
> to the treatment of Cassandra's sstables.
> One interesting and powerful tunable that Lucene provides is 'mergeFactor':
> EDIT: better link: 
> http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/core/org/apache/lucene/index/IndexWriter.html#setMergeFactor(int)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to