Hi, Currently I’m pursuing a masters degree in CS and I’m in search of my year project theme (in distributed systems field), and Spark seems very interesting to me.
Can you suggest some problems or ideas to work on? By the way, what is the status of external sorting(https://spark-project.atlassian.net/browse/SPARK-983)?