When you partition via groupByKey tulpes (parts of the RDD) are moved from some 
node to another node based on key (hash partitioning).
Do the tuples remain part of 1 RDD as before but moved to different nodes or 
does this shuffling create, say, several RDDs which will have parts of the 
original RDD?

Thanks
-Adrian

Reply via email to