Github user nickwallen commented on the issue:
https://github.com/apache/metron/pull/1150
> @simonellistonball: Do we have to use groupByKey in the spark
implementation, is it not possible to use reduceByKey to build the profiles...
I had in the back of my mind that groupByKey might not be the most
performance option, but I just didn't focus any energy on that for the first
pass.
I will take a look and see if we can't use your advice. Thanks for the
pointer @simonellistonball !---
