[ 
https://issues.apache.org/jira/browse/FLINK-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14626042#comment-14626042
 ] 

ASF GitHub Bot commented on FLINK-1967:
---------------------------------------

Github user aljoscha commented on the pull request:

    https://github.com/apache/flink/pull/906#issuecomment-121168918
  
    The job I ran was source -> repartition -> sink, where the source is 
generating tuples in an infinite loop.
    
    The gist of the data is this (I'm rerunning some of them with longer 
runtimes to remove outliers):
     - Introducing timestamp for each record that is set at source incurs about 
a 15 % performance hit in these tests 
     - Watermarks have no additional overhead once the interval is higher than 
100 sec (I didn't test with lower intervals, such as 50 sec, but I will do 
these today.)


> Introduce (Event)time in Streaming
> ----------------------------------
>
>                 Key: FLINK-1967
>                 URL: https://issues.apache.org/jira/browse/FLINK-1967
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: Aljoscha Krettek
>            Assignee: Aljoscha Krettek
>
> This requires introducing a timestamp in streaming record and a change in the 
> sources to add timestamps to records. This will also introduce punctuations 
> (or low watermarks) to allow windows to work correctly on unordered, 
> timestamped input data. In the process of this, the windowing subsystem also 
> needs to be adapted to use the punctuations. Furthermore, all operators need 
> to be made aware of punctuations and correctly forward them. Then, a new 
> operator must be introduced to to allow modification of timestamps.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to