I'm a fan of kafka as well. We've been using 0.7.2 for about a year.
I recommend it strongly.
But I will point one thing. Not an issue with Kafka itself but when the
client side has failed, choosing what offset to reset to is not an exact
science. You will have to decide how much data loss or dat
Richard - I can't speak about 0.8x, but we've been using Kafka in a most
critical capacity on production since the early days @ Datadog.
And...
- It never failed (!)
- It scaled predictably
- Its performance was consistent
- The simplicity of its design made it really easy to reason about
We
Richard, KAFKA-156 is something that the client could take care of. It is
2 years old and a bit of a hack suggestion for what Replication in 0.8.0
has become and I would argue we close it (we should go through all the
ticket prior to 0.9 and chat about them, separate thread I will start on
dev).
I'm excited about Kafka but want to be sure it is ready (or will be soon)
for a critical data pipeline.
What are the showstoppers, if any? eg.
https://issues.apache.org/jira/browse/KAFKA-156
mentioned here:
http://stackoverflow.com/questions/12130481/is-kafka-ready-for-production-use/12764663#1