[ 
https://issues.apache.org/jira/browse/FLINK-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16672285#comment-16672285
 ] 

ASF GitHub Bot commented on FLINK-5697:
---------------------------------------

EronWright commented on issue #6980: [FLINK-5697] [kinesis] Add periodic 
per-shard watermark support
URL: https://github.com/apache/flink/pull/6980#issuecomment-435202809
 
 
   There is a caveat with this implementation that the docs should perhaps 
mention.  The caveat is that it may produce spurious late events when 
processing a backlog of data.
   
   Here's an example of when that may occur.  Imagine that subtask 1 is 
processing shard A and subtask 2 is processing shard B.  Shard A has reached 
6:00 in event time (as per the assigner), and so the subtask emits the 
corresponding watermark.  At this point, the subtask has made the irrevocable 
assertion that subsequent events will be past 6:00.   Meanwhile, Shard B is at 
5:30 and undergoes a split into C/D.  If either shard is subsequently assigned 
to subtask 1, the events will be considered late due to the assertion made 
earlier.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Add per-shard watermarks for FlinkKinesisConsumer
> -------------------------------------------------
>
>                 Key: FLINK-5697
>                 URL: https://issues.apache.org/jira/browse/FLINK-5697
>             Project: Flink
>          Issue Type: New Feature
>          Components: Kinesis Connector, Streaming Connectors
>            Reporter: Tzu-Li (Gordon) Tai
>            Assignee: Thomas Weise
>            Priority: Major
>              Labels: pull-request-available
>
> It would be nice to let the Kinesis consumer be on-par in functionality with 
> the Kafka consumer, since they share very similar abstractions. Per-partition 
> / shard watermarks is something we can add also to the Kinesis consumer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to