Spark friends, I recently wrote up a blog post with examples of some of the standard techniques for improving Spark application performance:
http://chapeau.freevariable.com/2014/09/improving-spark-application-performance.html The idea is that we start with readable but poorly-performing code and iteratively refine it, looking at the performance and operational consequences of a series of simple changes. My intention is that this would provide some context for folks who are new to Spark and looking to improve their prototype applications. If you have additional suggestions for techniques to address, let me know! I'd be happy to write a follow-up post. best, wb --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org