repair broke TTL based expiration

Radim Kolar Mon, 19 Mar 2012 03:17:11 -0700

I suspect that running cluster wide repair interferes with TTL basedexpiration. I am running repair every 7 days and using TTL expirationtime 7 days too. Data are never deleted.Stored data in cassandra are always growing (watching them for 3 months)but they should not. If i run manual cleanup, some data are deleted butjust about 5%. Currently there are about 3-5 times more rows then iestimate.


I suspect that running repair on data with TTL can cause:

1. time check for expired records is ignored and these data are streamedto other node and they will be alive again

or

2. streaming data are propagated with full TTL. Lets say that i have ttl7 days, data are stored for 5 days and then repaired, they should besent to other node with ttl 2 days not 7.

Can someone do testing on this case? I could not play with productioncluster too much.

repair broke TTL based expiration

Reply via email to