Another thing that came up after I started working on an implementation is
that in addition to deprecating "retention" from the Windows interface, we
also need to deprecate "segmentInterval", for the same reasons. I simply
overlooked it previously. I've updated the KIP accordingly.

Hopefully, this doesn't change anyone's vote.

Thanks,
-John

On Mon, Jul 30, 2018 at 5:31 PM John Roesler <j...@confluent.io> wrote:

> Thanks Guozhang,
>
> Thanks for that catch. to clarify, currently, events are "late" only when
> they are older than the retention period. Currently, we detect this in the
> processor and record it as a "skipped-record". We then do not attempt to
> store the event in the window store. If a user provided a pre-configured
> window store with a retention period smaller than the one they specify via
> Windows#until, the segmented store will drop the update with no metric and
> record a debug-level log.
>
> With KIP-328, with the introduction of "grace period" and moving retention
> fully into the state store, we need to have metrics for both "late events"
> (new records older than the grace period) and "expired window events" (new
> records for windows that are no longer retained in the state store). I
> already proposed metrics for the late events, and I've just updated the KIP
> with metrics for the expired window events. I also updated the KIP to make
> it clear that neither late nor expired events will count as
> "skipped-records" any more.
>
> -John
>
> On Mon, Jul 30, 2018 at 4:22 PM Guozhang Wang <wangg...@gmail.com> wrote:
>
>> Hi John,
>>
>> Thanks for the updated KIP, +1 from me, and one minor suggestion:
>>
>> Following your suggestion of the differentiation of `skipped-records` v.s.
>> `late-event-drop`, we should probably consider moving the scenarios where
>> records got ignored due the window not being available any more in
>> windowed
>> aggregation operators from the `skipped-records` metrics recording to the
>> `late-event-drop` metrics recording.
>>
>>
>>
>> Guozhang
>>
>>
>> On Mon, Jul 30, 2018 at 1:36 PM, Bill Bejeck <bbej...@gmail.com> wrote:
>>
>> > Thanks for the KIP!
>> >
>> > +1
>> >
>> > -Bill
>> >
>> > On Mon, Jul 30, 2018 at 3:42 PM Ted Yu <yuzhih...@gmail.com> wrote:
>> >
>> > > +1
>> > >
>> > > On Mon, Jul 30, 2018 at 11:46 AM John Roesler <j...@confluent.io>
>> wrote:
>> > >
>> > > > Hello devs,
>> > > >
>> > > > The discussion of KIP-328 has gone some time with no new comments,
>> so I
>> > > am
>> > > > calling for a vote!
>> > > >
>> > > > Here's the KIP: https://cwiki.apache.org/confluence/x/sQU0BQ
>> > > >
>> > > > The basic idea is to provide:
>> > > > * more usable control over update rate (vs the current state store
>> > > caches)
>> > > > * the final-result-for-windowed-computations feature which several
>> > people
>> > > > have requested
>> > > >
>> > > > Thanks,
>> > > > -John
>> > > >
>> > >
>> >
>>
>>
>>
>> --
>> -- Guozhang
>>
>

Reply via email to