I am using the cluster with 3 cassandra nodes, the cluster version is 3.0.9. Each day about 200~300 million records are inserted into the cluster. As time goes by, more and more data occupied more and more disk space. Currently, the data distribution on each node is as the following:
UN 172.20.5.4 2.5 TiB 256 66.3% c5271e74-19a1-4cee-98d7-dc169cf87e95 rack1 UN 172.20.5.2 1.73 TiB 256 67.0% c623bbc0-9839-4d2d-8ff3-db7115719d59 rack1 UN 172.20.5.3 1.86 TiB 256 66.7% c555e44c-9590-4f45-aea4-f5eca68180b2 rack1 There is only one datacenter. The compaciton strategy is here: compaction = {'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'max_threshold': '32', 'min_threshold': '12', 'tombstone_threshold': '0.1', 'unchecked_tombstone_compaction': 'true'} AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'} AND crc_check_chance = 1.0 AND dclocal_read_repair_chance = 0.1 AND default_time_to_live = 8640000 AND gc_grace_seconds = 432000 I really want to know about how to do cassandra routine maintenance ? I found the data seems to grow faster and the disk is in heavy load. Sometimes I found the data inconsistency: two different results appear with the same query. So what I shoud I do to keep the cluster healthy, how to maintain the cluster? I hope some help very much! Thanks a lot ! --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org For additional commands, e-mail: user-h...@cassandra.apache.org