: Friday, May 15, 2015 11:02 AM
To: user
Subject: question about sparksql caching
Hi all,
We are planing to use SparkSQL in a DW system. There's a question about the
caching mechanism of SparkSQL.
For example, if I have a SQL like sqlContext.sql(select c1, sum(c2) from T1,
T2 where T1.key=T2.key
Hi all,
We are planing to use SparkSQL in a DW system. There’s a question about the
caching mechanism of SparkSQL.
For example, if I have a SQL like sqlContext.sql(“select c1, sum(c2) from T1,
T2 where T1.key=T2.key group by c1”).cache()
Is it going to cache the final result or the raw data
Hi all,
We are planing to use SparkSQL in a DW system. There’s a question about the
caching mechanism of SparkSQL.
For example, if I have a SQL like sqlContext.sql(“select c1, sum(c2) from T1,
T2 where T1.key=T2.key group by c1”).cache()
Is it going to cache the final result or the raw data