[ 
https://issues.apache.org/jira/browse/BEAM-9308?focusedWorklogId=391041&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-391041
 ]

ASF GitHub Bot logged work on BEAM-9308:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 22/Feb/20 03:36
            Start Date: 22/Feb/20 03:36
    Worklog Time Spent: 10m 
      Work Description: steveniemitz commented on issue #10852: [BEAM-9308] 
Decorrelate state cleanup timers
URL: https://github.com/apache/beam/pull/10852#issuecomment-589913494
 
 
   > Why is this problem specific to the GC timer? How about the normal 
end-of-window timer that is used to fire windowed aggregations. For fixed 
windows there is one per key and those also fire all at the same time.
   
   heh, we already work around that on our own by using state + timers instead 
of the built-in combine transform.  We already decorrelate our end-of-window 
triggering (and we're now using the watermark hold feature for timers which 
simplified things a lot), but can't work around the state GC w/o changing the 
worker itself.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 391041)
    Time Spent: 1h 20m  (was: 1h 10m)

> Optimize state cleanup at end-of-window
> ---------------------------------------
>
>                 Key: BEAM-9308
>                 URL: https://issues.apache.org/jira/browse/BEAM-9308
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-dataflow
>            Reporter: Steve Niemitz
>            Assignee: Steve Niemitz
>            Priority: Major
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> When using state with a large keyspace, you can end up with a large amount of 
> state cleanup timers set to fire all 1ms after the end of a window.  This can 
> cause a momentary (I've observed 1-3 minute) lag in processing while windmill 
> and the java harness fire and process these cleanup timers.
> By spreading the firing over a short period after the end of the window, we 
> can decorrelate the firing of the timers and smooth the load out, resulting 
> in much less impact from state cleanup.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to