Great article, especially the use of a custom partitioner. Also, sorting by multiple fields by creating a tuple out of them is an awesome, easy to miss, Scala feature.
Sent from my iPhone On 04 Oct 2015, at 21:41, Bill Bejeck <bbej...@gmail.com<mailto:bbej...@gmail.com>> wrote: I've written blog post on secondary sorting in Spark and I'd thought I'd share it with the group http://codingjunkie.net/spark-secondary-sort/ Thanks, Bill