Hi there,

As noted in the 0.10.0.0-RC4 release thread, we (Heroku Kafka) have been
doing extensive benchmarking of Kafka. In our case this is to help give
customers a good idea of the performance of our various configurations. For
this we orchestrate the Kafka `producer-perf.sh` and `consumer-perf.sh`
across multiple machines, which was relatively easy to do and very
successful (recently leading to a doc change and a good lesson about 0.10).

However, we're finding one thing missing from the current producer/consumer
perf tests, which is that there's no good perf testing on compacted topics.
Some folk will undoubtedly use compacted topics, so it would be extremely
helpful (I think) for the community to have benchmarks that test
performance on compacted topics. We're interested in working on this and
contributing it upstream, but are pretty unsure what such a test should
look like. One straw proposal is to adapt the existing producer/consumer
perf tests to work on a compacted topic, likely with an additional flag on
the producer that lets you choose how wide a key range to emit, if it
should emit deletes (and how often to do so) and so on. Is there anything
more we could or should do there?

We're happy writing the code here, and want to continue contributing back,
I'd just love a hand thinking about what perf tests for compacted topics
should look like.

Thanks

Tom Crayford
Heroku Kafka

Reply via email to