apologies for asking yet again about spark memory assumptions, but i cant
seem to keep it in my head.

if i use PairRDDFunctions.cogroup, it returns for every key 2 iterables. do
the contents of these iterables have to fit in memory? or is the data
streamed?

Reply via email to