Hey, Here is my need: program A does something on a set of data and produces results, program B does that on another set, and finally, program C combines the data of A and B. Of course, the easy way is to dump all on disk after A and B are done, but I wanted to avoid this.
I was thinking of creating a temp view, but I do not really like the temp aspect of it ;). Any idea (they are all worth sharing) jg --------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org