There is some point where you simply need more machines.

On Mon, Jan 7, 2013 at 5:02 PM, Michael Kjellman <mkjell...@barracuda.com>wrote:

> Right, I guess I'm saying that you should try loading your data with
> leveled compaction and see how your compaction load is.
>
> Your work load sounds like leveled will fit much better than size tiered.
>
> From: Brian Tarbox <tar...@cabotresearch.com>
> Reply-To: "user@cassandra.apache.org" <user@cassandra.apache.org>
> Date: Monday, January 7, 2013 1:58 PM
> To: "user@cassandra.apache.org" <user@cassandra.apache.org>
> Subject: Re: help turning compaction..hours of run to get 0%
> compaction....
>
> The problem I see is that it already takes me more than 24 hours just to
> load my data...during which time the logs say I'm spending tons of time
> doing compaction.  For example in the last 72 hours I'm consumed* 20 hours
> * per machine on compaction.
>
> Can I conclude from that than I should be (perhaps drastically) increasing
> my compaction_mb_per_sec on the theory that I'm getting behind?
>
> The fact that it takes me 3 days or more to run a test means its hard to
> just play with values and see what works best, so I'm trying to understand
> the behavior in detail.
>
> Thanks.
>
> Brain
>
>
> On Mon, Jan 7, 2013 at 4:13 PM, Michael Kjellman 
> <mkjell...@barracuda.com>wrote:
>
>> http://www.datastax.com/dev/blog/when-to-use-leveled-compaction
>>
>> "If you perform at least twice as many reads as you do writes, leveled
>> compaction may actually save you disk I/O, despite consuming more I/O for
>> compaction. This is especially true if your reads are fairly random and
>> don’t focus on a single, hot dataset."
>>
>> From: Brian Tarbox <tar...@cabotresearch.com>
>> Reply-To: "user@cassandra.apache.org" <user@cassandra.apache.org>
>> Date: Monday, January 7, 2013 12:56 PM
>> To: "user@cassandra.apache.org" <user@cassandra.apache.org>
>> Subject: Re: help turning compaction..hours of run to get 0%
>> compaction....
>>
>> I have not specified leveled compaction so I guess I'm defaulting to size
>> tiered?  My data (in the column family causing the trouble) insert once,
>> ready many, update-never.
>>
>> Brian
>>
>>
>> On Mon, Jan 7, 2013 at 3:13 PM, Michael Kjellman <mkjell...@barracuda.com
>> > wrote:
>>
>>> Size tiered or leveled compaction?
>>>
>>> From: Brian Tarbox <tar...@cabotresearch.com>
>>> Reply-To: "user@cassandra.apache.org" <user@cassandra.apache.org>
>>> Date: Monday, January 7, 2013 12:03 PM
>>> To: "user@cassandra.apache.org" <user@cassandra.apache.org>
>>> Subject: help turning compaction..hours of run to get 0% compaction....
>>>
>>> I have a column family where I'm doing 500 inserts/sec for 12 hours or
>>> so at time.  At some point my performance falls off a cliff due to time
>>> spent doing compactions.
>>>
>>> I'm seeing row after row of logs saying that after 1 or 2 hours of
>>> compactiing it reduced to 100% of 99% of the original.
>>>
>>> I'm trying to understand what direction this data points me to in term
>>> of configuration change.
>>>
>>>    a) increase my compaction_throughput_mb_per_sec because I'm falling
>>> behind (am I falling behind?)
>>>
>>>    b) enable multi-threaded compaction?
>>>
>>> Any help is appreciated.
>>>
>>> Brian
>>>
>>> ----------------------------------
>>> Join Barracuda Networks in the fight against hunger.
>>> To learn how you can help in your community, please visit:
>>> http://on.fb.me/UAdL4f
>>>   ­­
>>>
>>
>>
>> ----------------------------------
>> Join Barracuda Networks in the fight against hunger.
>> To learn how you can help in your community, please visit:
>> http://on.fb.me/UAdL4f
>>   ­­
>>
>
>
> ----------------------------------
> Join Barracuda Networks in the fight against hunger.
> To learn how you can help in your community, please visit:
> http://on.fb.me/UAdL4f
>   ­­
>

Reply via email to