Hi,

If you are just de-duplicating based on a key and have a limited batch of
transactions, then you should go with BoundedDedup.

TimeBasedDedup is for cases where you want to dedup within a stream with
expiry based on the time in your tuples.

~ Bhupesh


_______________________________________________________

Bhupesh Chawda

E: bhup...@datatorrent.com | Twitter: @bhupeshsc

www.datatorrent.com  |  apex.apache.org



On Thu, May 25, 2017 at 7:39 PM, Guilherme Hott <guilhermeh...@gmail.com>
wrote:

> Hi everyone,
>
> I have in my kafka operator messages coming and in my input port and
> I have to process and emit a batch of transactions to a Dedup operator.
> Should I use BoundedDedup or TimeBasedDedup?
>
> Thanks
>
> --
> *Guilherme Hott*
> *Software Engineer*
> Skype: guilhermehott
> @guilhermehott
> https://www.linkedin.com/in/guilhermehott
>
>

Reply via email to