[jira] [Commented] (CASSANDRA-4310) Multiple independent Level Compactions in Parallel(Useful for SSD).

sankalp kohli (JIRA) Mon, 18 Jun 2012 22:42:46 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-4310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13396529#comment-13396529
 ]


sankalp kohli commented on CASSANDRA-4310:
------------------------------------------

What you are saying is true. But the improvement I am saying has more than 
this. It also does compactions in parallel between different levels and also 
multiple compactions per level. So it will definitely speed things up. It is 
quite frustrating to see Disk not being fully used when you are using SSD.

Also like you said L0->L1 is the biggest bottleneck. This will help it in a 
way. So when L0(32 stable) gets merged with L1, then L1 will merge with L2 and 
so on. But with this, you will be doing L0-L1 compactions almost every cycle 
unless L1-L2 is happening. So when say L3 -> L4, L0->L1 compaction won't happen 
when it can. 

So this solution cannot help parallelize L0->L1, but it will help since it runs 
L0->L1 almost every time. It does not get blocked by compactions in higher 
levels. 
                
> Multiple independent Level Compactions in Parallel(Useful for SSD). 
> --------------------------------------------------------------------
>
>                 Key: CASSANDRA-4310
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4310
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Core
>    Affects Versions: 1.1.1, 1.1.2
>            Reporter: sankalp kohli
>              Labels: compaction, features, leveled, performance, ssd
>
> Problem: If you are inserting data into cassandra and level compaction cannot 
> catchup, you will create lot of files in L0.  
> Here is a solution which will help here and also increase the performance of 
> level compaction.
> We can do many compactions in parallel for unrelated data.
> 1) For no over lapping levels. Ex: when L0 stable is compacting with L1, we 
> can do compactions in other levels like L2 and L3 if they are eligible.
> 2) We can also do compactions with files in L1 which are not participating in 
> L0 compactions.
> This is specially useful if you are using SSD and is not bottlenecked by IO. 
> I am seeing this issue in my cluster. The compactions pending are more than 
> 50k and the disk usage is not that much(I am using SSD).
> I am doing multithreaded to true and also not throttling the IO by putting 
> the value as 0. 
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4310) Multiple independent Level Compactions in Parallel(Useful for SSD).

Reply via email to