Expensive metrics?

Michael Burman Thu, 22 Feb 2018 06:11:01 -0800

Hi,

I wanted to get some input from the mailing list before making a JIRAand potential fixes. I'll touch the performance more on latter part, butthere's one important question regarding the write latency metricrecording place. Currently we measure the writeLatency (and metric writesampler..) in ColumnFamilyStore.apply() and this is also the metric wethen replicate to Keyspace metrics etc.

This is an odd place for writeLatency. Not to mention it is in ahot-path of Memtable-modifications, but it also does not measure thereal write latency, since it completely ignores the CommitLog latency inthat same process. Is the intention really to measureMemtable-modification latency only or the actual write latencies?

Then the real issue.. this single metric is a cause of huge overhead inMemtable processing. There are several metrics / events in the CFS applymethod, including metric sampler, storageHook reportWrite,colUpdateTimeDeltaHistogram and metric.writeLatency. These are not freeat all when it comes to the processing. I made a small JMH benchmarkhere: https://gist.github.com/burmanm/b5b284bc9f1d410b1d635f6d3dac3adethat I'll be referring to.

The most offending of all these metrics is the writeLatency metric. Whatit does is update the latency in codahale's timer, doing a histogramupdate and then going through all the parent metrics also which updatethe keyspace writeLatency and globalWriteLatency. When measuring theperformance of Memtable.put with parameter of 1 partition (to reduce theConcurrentSkipListMap search speed impact - that's separate issue andtakes a little bit longer to solve although I've started to prototypesomething..) on my machine I see 1.3M/s performance with the metric andwhen it is disabled the performance climbs to 4M/s. So the overhead forthis single metric is ~2/3 of total performance. That's insane. My perfstats indicate that the CPU is starved as it can't get enough data in.

Removing the replication from TableMetrics to the Keyspace & globallatencies in the write time (and doing this when metrics are requestedinstead) improves the performance to 2.1M/s on my machine. It's animprovement, but it's still huge amount. Even when we pressure theConcurrentSkipListMap with 100 000 partitions in one active Memtable,the performance drops by about ~40% due to this metric, so it's never free.

i did not find any discussion replacing the metric processing withsomething faster, so has this been considered before? At least for theseperformance sensitive ones. The other issue is obviously the use ofSystem.nanotime() which by itself is very slow (two System.nanotime()calls eat another ~1M/s from the performance)

My personal quick fix would be to move writeLatency to Keyspace.apply,change write time aggregates to read time processing (metrics are readless often than we write data) and maybe even reduce the nanotime ->currentTimeMillis (even given it's relative lack of precision). That is- if these metrics make any sense at all at CFS level? Maybe theseshould be measured from the network processing time (including all thedeserializations and such) ? Especially if at some point the smarterthreading / eventlooping changes go forward (in which case they mightsleep at some "queue" for a while).


  - Micke


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org

Expensive metrics?

Reply via email to