On Fri, Aug 14, 2015 at 1:51 PM, Dan Smith <[email protected]> wrote:
> The java 8 reduce() method returns a scalar. So my .map().reduce() example
> didn't really have a shuffle phase. We haven't implemented any sort of
> shuffle, but our reduce is processed on the servers first and then
> aggregated on the client. I'm not quite sure what the best way to work a
> shuffle into this stream API would be, actually. I suppose using a map
> followed by a sort(). We didn't do anything clever with sort either :)

Isn't what you're looking for analogous to  reduce() versus reduceByKey()
in Spark terminology

Thanks,
Roman.

Reply via email to