You can replace the node directly why to add a node and decommission the
another node. Just replace the node with the new node and your topology
remains the same so no need to run the cleanup .

On Fri, May 5, 2023 at 10:26 AM Jaydeep Chovatia <chovatia.jayd...@gmail.com>
wrote:

> We use STCS, and our experience with *cleanup* is that it takes a long
> time to run in a 100-node cluster. We would like to replace one node every
> day for various purposes in our fleet.
>
> If we run *cleanup* after each node replacement, then it might take, say,
> 15 days to complete, and that hinders our node replacement frequency.
>
> Do you see any other options?
>
> Jaydeep
>
> On Thu, May 4, 2023 at 9:47 PM Jeff Jirsa <jji...@gmail.com> wrote:
>
>> You should 100% trigger cleanup each time or you’ll almost certainly
>> resurrect data sooner or later
>>
>> If you’re using leveled compaction it’s especially cheap. Stcs and twcs
>> are worse, but if you’re really scaling that often, I’d be considering lcs
>> and running cleanup just before or just after each scaling
>>
>> On May 4, 2023, at 9:25 PM, Jaydeep Chovatia <chovatia.jayd...@gmail.com>
>> wrote:
>>
>> 
>> Thanks, Jeff!
>> But in our environment we replace nodes quite often for various
>> optimization purposes, etc. say, almost 1 node per day (node *addition*
>> followed by node *decommission*, which of course changes the topology),
>> and we have a cluster of size 100 nodes with 300GB per node. If we have to
>> run cleanup on 100 nodes after every replacement, then it could take
>> forever.
>> What is the recommendation until we get this fixed in Cassandra itself as
>> part of compaction (w/o externally triggering *cleanup*)?
>>
>> Jaydeep
>>
>> On Thu, May 4, 2023 at 8:14 PM Jeff Jirsa <jji...@gmail.com> wrote:
>>
>>> Cleanup is fast and cheap and basically a no-op if you haven’t changed
>>> the ring
>>>
>>> After cassandra has transactional cluster metadata to make ring changes
>>> strongly consistent, cassandra should do this in every compaction. But
>>> until then it’s left for operators to run when they’re sure the state of
>>> the ring is correct .
>>>
>>>
>>>
>>> On May 4, 2023, at 7:41 PM, Jaydeep Chovatia <chovatia.jayd...@gmail.com>
>>> wrote:
>>>
>>> 
>>> Isn't this considered a kind of *bug* in Cassandra because as we know
>>> *cleanup* is a lengthy and unreliable operation, so relying on the
>>> *cleanup* means higher chances of data resurrection?
>>> Do you think we should discard the unowned token-ranges as part of the
>>> regular compaction itself? What are the pitfalls of doing this as part of
>>> compaction itself?
>>>
>>> Jaydeep
>>>
>>> On Thu, May 4, 2023 at 7:25 PM guo Maxwell <cclive1...@gmail.com> wrote:
>>>
>>>> compact ion will just merge duplicate data and remove delete data in
>>>> this node .if you add or remove one node for the cluster, I think clean up
>>>> is needed. if clean up failed, I think we should come to see the reason.
>>>>
>>>> Runtian Liu <curly...@gmail.com> 于2023年5月5日周五 06:37写道:
>>>>
>>>>> Hi all,
>>>>>
>>>>> Is cleanup the sole method to remove data that does not belong to a
>>>>> specific node? In a cluster, where nodes are added or decommissioned from
>>>>> time to time, failure to run cleanup may lead to data resurrection issues,
>>>>> as deleted data may remain on the node that lost ownership of certain
>>>>> partitions. Or is it true that normal compactions can also handle data
>>>>> removal for nodes that no longer have ownership of certain data?
>>>>>
>>>>> Thanks,
>>>>> Runtian
>>>>>
>>>>
>>>>
>>>> --
>>>> you are the apple of my eye !
>>>>
>>>

Reply via email to