I was wondering less partitioning rdds could help the Spark performance and reduce shuffling? is it true?
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/shuffle-vs-performance-tp4255.html Sent from the Apache Spark User List mailing list archive at Nabble.com.