apologies for asking yet again about spark memory assumptions, but i cant seem to keep it in my head.
if i use PairRDDFunctions.cogroup, it returns for every key 2 iterables. do the contents of these iterables have to fit in memory? or is the data streamed?