[
https://issues.apache.org/jira/browse/SPARK-15968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Allman updated SPARK-15968:
-----------------------------------
Description: The {{getCached}} method of {{HiveMetastoreCatalog}} computes
{{pathsInMetastore}} from the metastore relation's catalog table. This only
returns the table base path, which is not correct for non-empty partitioned
tables. As a result, cache lookups on non-empty partitioned tables always miss
and these relations are always recomputed. (was: The {{getCached}} method of
{{HiveMetastoreCatalog}} computes {{pathsInMetastore}} from the metastore
relation's catalog table. This only returns the table base path, which is not
correct for partitioned tables. As a result, cache lookups on partitioned
tables always miss and these relations are always recomputed.)
> HiveMetastoreCatalog does not correctly validate partitioned metastore
> relation when searching the internal table cache
> -----------------------------------------------------------------------------------------------------------------------
>
> Key: SPARK-15968
> URL: https://issues.apache.org/jira/browse/SPARK-15968
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 2.0.0
> Reporter: Michael Allman
> Labels: hive, metastore
>
> The {{getCached}} method of {{HiveMetastoreCatalog}} computes
> {{pathsInMetastore}} from the metastore relation's catalog table. This only
> returns the table base path, which is not correct for non-empty partitioned
> tables. As a result, cache lookups on non-empty partitioned tables always
> miss and these relations are always recomputed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]