[GitHub] spark pull request #13818: [SPARK-15968][SQL] Nonempty partitioned metastore...

erfangc Sun, 07 Aug 2016 12:46:51 -0700

Github user erfangc commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13818#discussion_r73807925
  
    --- Diff: 
sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala
 ---
    @@ -298,6 +298,7 @@ case class InsertIntoHiveTable(
     
         // Invalidate the cache.
         sqlContext.sharedState.cacheManager.invalidateCache(table)
    +    
sqlContext.sessionState.catalog.invalidateTable(table.catalogTable.identifier)
    --- End diff --
    
    Hi @mallman does this change cause new sessions (ex: external App to the 
ThriftServer via JDBC) to not see the cached tables? I noticed this in the 
released version 2.0.0 whereby `CACHE TABLE` in one session has no effect on 
new sessions. Future SQL statements are still reading the underlying Parquet 
files from Disk (as evidenced by tasks being `NODE_LOCAL` and `RACK_LOCAL` 
instead of `PROCESS_LOCAL`). Sorry if this question is unrelated to your patch, 
but this became a major issue in 2.0.0 for us, where as in 1.6.2 we do not have 
an issue.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #13818: [SPARK-15968][SQL] Nonempty partitioned metastore...

Reply via email to