Re: Multiple joins in Spark

2015-10-20 Thread Shyam Parimal Katti
hurt your performance. The actual physical > plan to execute your SQL query is generated by the result of Catalyst > optimizer. > > Good luck, > > Xiao Li > > > > 2015-10-16 20:53 GMT-07:00 Shyam Parimal Katti <spk...@nyu.edu>: > >> Thanks Xiao! Question abo

Multiple joins in Spark

2015-10-16 Thread Shyam Parimal Katti
Hello All, I have a following SQL query like this: select a.a_id, b.b_id, c.c_id from table_a a join table_b b on a.a_id = b.a_id join table_c c on b.b_id = c.b_id In scala i have done this so far: table_a_rdd = sc.textFile(...) table_b_rdd = sc.textFile(...) table_c_rdd = sc.textFile(...)