On Fri, Aug 14, 2015 at 1:51 PM, Dan Smith <[email protected]> wrote: > The java 8 reduce() method returns a scalar. So my .map().reduce() example > didn't really have a shuffle phase. We haven't implemented any sort of > shuffle, but our reduce is processed on the servers first and then > aggregated on the client. I'm not quite sure what the best way to work a > shuffle into this stream API would be, actually. I suppose using a map > followed by a sort(). We didn't do anything clever with sort either :)
Isn't what you're looking for analogous to reduce() versus reduceByKey() in Spark terminology Thanks, Roman.
