subject:"Repair of 5GB data vs. disk throughput does not make sense"

Re: Repair of 5GB data vs. disk throughput does not make sense

2018-04-26 Thread horschi

Hi Thomas, I don't think I have seen compaction ever being faster. For me, tables with small values usually are around 5 MB/s with a single compaction. With larger blobs (few KB per blob) I have seen 16MB/s. Both with "nodetool setcompactionthroughput 0". I don't think its disk related either.

Re: Repair of 5GB data vs. disk throughput does not make sense

2018-04-26 Thread Jonathan Haddad

I can't say for sure, because I haven't measured it, but I've seen a combination of readahead + large chunk size with compression cause serious issues with read amplification, although I'm not sure if or how it would apply here. Likely depends on the size of your partitions and the fragmentation

Repair of 5GB data vs. disk throughput does not make sense

2018-04-26 Thread Steinmaurer, Thomas

Hello, yet another question/issue with repair. Cassandra 2.1.18, 3 nodes, RF=3, vnode=256, data volume ~ 5G per node only. A repair (nodetool repair -par) issued on a single node at this data volume takes around 36min with an AVG of ~ 15MByte/s disk throughput (read+write) for the entire