James Thornton created KAFKA-1436:
-------------------------------------

             Summary: Idempotent Producer / Duplicate Detection
                 Key: KAFKA-1436
                 URL: https://issues.apache.org/jira/browse/KAFKA-1436
             Project: Kafka
          Issue Type: Improvement
          Components: consumer, producer 
    Affects Versions: 0.8.0, 0.8.1, 0.8.2, 0.9.0, 0.8.1.1
            Reporter: James Thornton
            Assignee: Neha Narkhede


Dealing with duplicate messages is one of the major issues for teams using 
Kafka, and Jay Kreps posted a page on implementing an Idempotent Producer to 
address this issue:

https://cwiki.apache.org/confluence/display/KAFKA/Idempotent+Producer

MapDB 1.0 (https://github.com/jankotek/MapDB) was just released, and either it 
or Java Chronicle (https://github.com/OpenHFT/Java-Chronicle/) could be 
embedded within each broker to provide a high-performance, random-access, 
off-heap store for request IDs.

As Jay points out in his post, global unique request IDs probably aren't 
needed, but if that need should arise, Twitter's Snowflake service 
(https://github.com/twitter/snowflake/) might be useful.





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to