RE: question about sparksql caching

2015-05-15 Thread Cheng, Hao
: Friday, May 15, 2015 11:02 AM To: user Subject: question about sparksql caching Hi all, We are planing to use SparkSQL in a DW system. There's a question about the caching mechanism of SparkSQL. For example, if I have a SQL like sqlContext.sql(select c1, sum(c2) from T1, T2 where T1.key=T2.key

question about sparksql caching

2015-05-14 Thread sequoiadb
Hi all, We are planing to use SparkSQL in a DW system. There’s a question about the caching mechanism of SparkSQL. For example, if I have a SQL like sqlContext.sql(“select c1, sum(c2) from T1, T2 where T1.key=T2.key group by c1”).cache() Is it going to cache the final result or the raw data

question about sparksql caching

2015-05-14 Thread sequoiadb
Hi all, We are planing to use SparkSQL in a DW system. There’s a question about the caching mechanism of SparkSQL. For example, if I have a SQL like sqlContext.sql(“select c1, sum(c2) from T1, T2 where T1.key=T2.key group by c1”).cache() Is it going to cache the final result or the raw data