[GitHub] [spark] imback82 commented on a change in pull request #30598: [SPARK-33654][SQL] Migrate CACHE TABLE to use UnresolvedRelation to resolve identifier

GitBox Fri, 04 Dec 2020 13:20:53 -0800


imback82 commented on a change in pull request #30598:
URL: https://github.com/apache/spark/pull/30598#discussion_r536384428




##########
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala
##########
@@ -706,3 +705,13 @@ case class ShowPartitions(
   override val output: Seq[Attribute] = Seq(
     AttributeReference("partition", StringType, nullable = false)())
 }
+
+/**
+ * The logical plan of the CACHE TABLE command.
+ */
+case class CacheTable(
+    table: LogicalPlan,
+    multipartIdentifier: Seq[String],
+    query: Option[LogicalPlan],

Review comment:
       > hmm, for the `query`, it should apply filter pushdown and contain 
`DataSourceV2ScanRelation`, right?
   
   We create a temp view and cache it as follow:
   ```scala
   Dataset.ofRows(sparkSession, query.get).createTempView(relationName)
   val df = sparkSession.table(relationName)
   sparkSession.sharedState.cacheManager.cacheQuery(df, Some(relationName))
   ```
   `cacheQuery` uses the "analyzed" plan of `df` to cache. The analyzed plan 
will have `DataSourceV2Relation` (but the optimized plan will have 
`DataSourceV2ScanRelation`).
   
   If we make the `query` as a child of the plan, `query` will be have an 
optimized plan with `DataSourceV2ScanRelation`.
   




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] imback82 commented on a change in pull request #30598: [SPARK-33654][SQL] Migrate CACHE TABLE to use UnresolvedRelation to resolve identifier

Reply via email to