[ https://issues.apache.org/jira/browse/APEXMALHAR-2283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15557187#comment-15557187 ]
Sandesh commented on APEXMALHAR-2283: ------------------------------------- During recovery - 0.9 Kafka Output operator. appId + operatorId -> used to distinguish tuples from other writers Extra window state -> tracks Kafka offsets, tuples between that Kafka offsets are read during recovery. To construct partial window. Hashcode -> replayed tuples are dropped rather than writing to Kafka It is important to make sure that logic works, as we have few customers using this operator in production. > Refactor kafka output operator > ------------------------------ > > Key: APEXMALHAR-2283 > URL: https://issues.apache.org/jira/browse/APEXMALHAR-2283 > Project: Apache Apex Malhar > Issue Type: Improvement > Reporter: Siyuan Hua > Assignee: Siyuan Hua > > The abstract kafka output operator needs to be refactored > 1. Needs to set some mandatory properties on operator level instead of kafka > property level. > 2. More document and examples > 3. Find a standard way to achieve exactly once in both 0.8 and 0.9 > More will be added when working on the ticket -- This message was sent by Atlassian JIRA (v6.3.4#6332)