[ 
https://issues.apache.org/jira/browse/APEXMALHAR-2283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15557187#comment-15557187
 ] 

Sandesh commented on APEXMALHAR-2283:
-------------------------------------

During recovery - 0.9 Kafka Output operator.

appId + operatorId -> used to distinguish tuples from other writers
Extra window state -> tracks Kafka offsets, tuples between that Kafka offsets 
are read during recovery. To construct partial window.
Hashcode -> replayed tuples are dropped rather than writing to Kafka

It is important to make sure that logic works, as we have few customers using 
this operator in production.

> Refactor kafka output operator
> ------------------------------
>
>                 Key: APEXMALHAR-2283
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2283
>             Project: Apache Apex Malhar
>          Issue Type: Improvement
>            Reporter: Siyuan Hua
>            Assignee: Siyuan Hua
>
> The abstract kafka output operator needs to be refactored
> 1. Needs to set some mandatory properties on operator level instead of kafka 
> property level.
> 2. More document and examples
> 3. Find a standard way to achieve exactly once in both 0.8 and 0.9
> More will be added when working on the ticket



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to