I'll email the docs people. 

I believe they are saying "use compaction throttling rather than this" not 
"this does nothing"

Although I used this in the last month on a machine with very little ram to 
limit compaction memory use.

Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 17/02/2013, at 7:05 AM, Alain RODRIGUEZ <arodr...@gmail.com> wrote:

> "Can you point to the docs."
> 
> http://www.datastax.com/docs/1.1/configuration/storage_configuration#max-compaction-threshold
> 
> And thanks about the rest of your answers, once again ;-).
> 
> Alain
> 
> 
> 2013/2/16 aaron morton <aa...@thelastpickle.com>
>>  Is that a feature that could possibly be developed one day ?
> No. 
> Timestamps are essentially internal implementation used to resolve different 
> values for the same column. 
> 
>> With "min_compaction_level_threshold" did you mean 
>> "min_compaction_threshold"  ? If so, why should I do that, what are the 
>> advantage/inconvenient of reducing this value ?
> 
> Yes, min_compaction_threshold, my bad. 
> If you have a wide row and delete a lot of values you will end up with a lot 
> of tombstones. These may dramatically reduce the read performance until they 
> are purged. Reducing the compaction threshold makes compaction happen more 
> frequently. 
> 
>> Looking at the doc I saw that: "max_compaction_threshold: Ignored in 
>> Cassandra 1.1 and later.". How to ensure that I'll always keep a small 
>> amount of SSTables then ?
> AFAIK it's not. 
> There may be some confusion about the location of the settings in CLI vs CQL. 
> Can you point to the docs. 
> 
> Cheers
> 
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> New Zealand
> 
> @aaronmorton
> http://www.thelastpickle.com
> 
> On 13/02/2013, at 10:14 PM, Alain RODRIGUEZ <arodr...@gmail.com> wrote:
> 
>> Hi Aaron, once again thanks for this answer.
>>> "So is it possible to delete all the data inserted in some CF between 2 
>>> dates or data older than 1 month ?"
>> "No. "
>> 
>> Why is there no way of deleting or getting data using the internal timestamp 
>> stored alongside of any inserted column (as described here: 
>> http://www.datastax.com/docs/1.1/ddl/column_family#standard-columns) ? Is 
>> that a feature that could possibly be developed one day ? It could be useful 
>> to perform delete of old data or to bring to a dev cluster just the last 
>> week of data for example.
>> 
>> With "min_compaction_level_threshold" did you mean 
>> "min_compaction_threshold"  ? If so, why should I do that, what are the 
>> advantage/inconvenient of reducing this value ?
>> 
>> Looking at the doc I saw that: "max_compaction_threshold: Ignored in 
>> Cassandra 1.1 and later.". How to ensure that I'll always keep a small 
>> amount of SSTables then ? Why is this deprecated ?
>> 
>> Alain
>> 
>> 
>> 2013/2/12 aaron morton <aa...@thelastpickle.com>
>>> So is it possible to delete all the data inserted in some CF between 2 
>>> dates or data older than 1 month ?
>> No. 
>> 
>> You need to issue row level deletes. If you don't know the row key you'll 
>> need to do range scans to locate them. 
>> 
>> If you are deleting parts of wide rows consider reducing the 
>> min_compaction_level_threshold on the CF to 2
>> 
>> Cheers
>> 
>> 
>> -----------------
>> Aaron Morton
>> Freelance Cassandra Developer
>> New Zealand
>> 
>> @aaronmorton
>> http://www.thelastpickle.com
>> 
>> On 12/02/2013, at 4:21 AM, Alain RODRIGUEZ <arodr...@gmail.com> wrote:
>> 
>>> Hi,
>>> 
>>> I would like to know if there is a way to delete old/unused data easily ?
>>> 
>>> I know about TTL but there are 2 limitations of TTL:
>>> 
>>> - AFAIK, there is no TTL on counter columns
>>> - TTL need to be defined at write time, so it's too late for data already 
>>> inserted.
>>> 
>>> I also could use a standard "delete" but it seems inappropriate for such a 
>>> massive.
>>> 
>>> In some cases, I don't know the row key and would like to delete all the 
>>> rows starting by, let's say, "1050#..." 
>>> 
>>> Even better, I understood that columns are always inserted in C* with 
>>> (name, value, timestamp). So is it possible to delete all the data inserted 
>>> in some CF between 2 dates or data older than 1 month ?
>>> 
>>> Alain
>> 
>> 
> 
> 

Reply via email to