Thank you Bhupesh. I think this is the best to do. On Thu, May 25, 2017 at 7:54 PM, Bhupesh Chawda <bhup...@datatorrent.com> wrote:
> Hi, > > If you are just de-duplicating based on a key and have a limited batch of > transactions, then you should go with BoundedDedup. > > TimeBasedDedup is for cases where you want to dedup within a stream with > expiry based on the time in your tuples. > > ~ Bhupesh > > > _______________________________________________________ > > Bhupesh Chawda > > E: bhup...@datatorrent.com | Twitter: @bhupeshsc > > www.datatorrent.com | apex.apache.org > > > > On Thu, May 25, 2017 at 7:39 PM, Guilherme Hott <guilhermeh...@gmail.com> > wrote: > >> Hi everyone, >> >> I have in my kafka operator messages coming and in my input port and >> I have to process and emit a batch of transactions to a Dedup operator. >> Should I use BoundedDedup or TimeBasedDedup? >> >> Thanks >> >> -- >> *Guilherme Hott* >> *Software Engineer* >> Skype: guilhermehott >> @guilhermehott >> https://www.linkedin.com/in/guilhermehott >> >> > -- *Guilherme Hott* *Software Engineer* Skype: guilhermehott @guilhermehott https://www.linkedin.com/in/guilhermehott