In my experience on a 1 Gb network localOrShuffleGrouping was a clear winner in terms of performance. But I haven't tested with 10 Gb, and if you have substantial business logic then that becomes a bigger factor than serializing/transferring data on the network. I think the performance of any given grouping is too dependent on your business logic; it will be difficult to quantify how well it performs in a canned benchmark. And sometimes your business logic will define a grouping for you (e.g. fields grouping) whether it's the best performer or not.
On Wed, Mar 5, 2014 at 1:05 PM, Roberto Coluccio <[email protected] > wrote: > Hello Michael, thanks for your feedback. > > I'm looking for a performance comparison. I know that not all the policies > are "really comparable", but even obvious comparisons all listed together > could be a useful reference. > > Roberto > > > On Wed, Mar 5, 2014 at 6:58 PM, Michael Rose <[email protected]>wrote: > >> What kind of comparisons are you looking for? How they functionally work? >> >> Michael Rose (@Xorlev <https://twitter.com/xorlev>) >> Senior Platform Engineer, FullContact <http://www.fullcontact.com/> >> [email protected] >> >> >> On Wed, Mar 5, 2014 at 9:52 AM, Roberto Coluccio < >> [email protected]> wrote: >> >>> Hello folks, >>> >>> I was unable to find any complete example (or, better, related work in >>> the scientific literature) in which (almost) all the *stream grouping >>> policies* have been used and compared. Do you have any reference you >>> could please share with me? >>> >>> Thank you and best regards, >>> >>> Roberto Coluccio >>> >> >> >
