It appears that RDDs can do a cartesian join, but not DataFrames. Is there a fundamental reason why not, or is this just waiting for someone to implement?
I know you can get the RDDs underlying the DataFrames and do the cartesian join that way, but you lose the schema of course. Nick