Re: How to make Cassandra flush CommitLog files more frequently?

2021-05-06 Thread Kane Wilson
> > Actually, the design and initiation of our service is based on the fact > that the CDC feature in Cassandra is used for streaming data changes in > Cassandra with low latency. If this is not the case, may I understand > what's the purpose and the intended use case for the CDC feature in >

Re: How to make Cassandra flush CommitLog files more frequently?

2021-05-05 Thread vytenis silgalis
I believe you could set your tables to flush to disk at specific intervals (memtable_flush_period_in_ms), note that you'd have to set this for all tables (not just the CDC enabled tables) to ensure that commitlog files are flushed to the cdc_raw directory. Or as Dhanunjaya noted you could just

Re: How to make Cassandra flush CommitLog files more frequently?

2021-05-04 Thread Dhanunjaya Tokala
One way to flush commitlog is nodetool flush On Cassandra nodes . On Tue, May 4, 2021 at 3:58 PM Bingqin Zhou wrote: > Hi Kane, > > Thank you for the insights! > > Reducing the total space on its own will help, however definitely test >> this as such a large drop could result in a massive

Re: How to make Cassandra flush CommitLog files more frequently?

2021-05-04 Thread Bingqin Zhou
Hi Kane, Thank you for the insights! Reducing the total space on its own will help, however definitely test this > as such a large drop could result in a massive increase in SSTables and > thus compaction overhead. You'll in general want to look into any property > that makes memtables flush

Re: How to make Cassandra flush CommitLog files more frequently?

2021-05-03 Thread Kane Wilson
(removing dev) commitlog_segment_size_in_mb isn't going to help, in fact you probably don't want to modify this as it'll reduce the maximum size of your mutations. Reducing the total space on its own will help, however definitely test this as such a large drop could result in a massive increase

How to make Cassandra flush CommitLog files more frequently?

2021-05-03 Thread Bingqin Zhou
Hi, We're working with the CDC feature to develop an agent to stream changes in Cassandra DB into Kafka. However, the CDC feature doesn't work well for us so far because CommitLog files are rarely flushed into cdc_raw directory, and the frequency can be as low as a few months. Is there any