[jira] [Assigned] (SPARK-17603) Utilize Hive-generated Statistics For Partitioned Tables

2016-09-19 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-17603:


Assignee: Apache Spark

> Utilize Hive-generated Statistics For Partitioned Tables
> 
>
> Key: SPARK-17603
> URL: https://issues.apache.org/jira/browse/SPARK-17603
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.0.0, 2.1.0
>Reporter: Xiao Li
>Assignee: Apache Spark
>
> For non-partitioned tables, Hive-generated statistics are stored in table 
> properties. However, for partitioned tables, Hive-generated statistics are 
> stored in partition properties. Thus, we are unable to utilize the 
> Hive-generated statistics for partitioned tables. 
> The statistics might not be gathered for all the partitions in Hive. For 
> partial collection, we will not utilize the Hive-generated statistics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-17603) Utilize Hive-generated Statistics For Partitioned Tables

2016-09-19 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-17603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-17603:


Assignee: (was: Apache Spark)

> Utilize Hive-generated Statistics For Partitioned Tables
> 
>
> Key: SPARK-17603
> URL: https://issues.apache.org/jira/browse/SPARK-17603
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.0.0, 2.1.0
>Reporter: Xiao Li
>
> For non-partitioned tables, Hive-generated statistics are stored in table 
> properties. However, for partitioned tables, Hive-generated statistics are 
> stored in partition properties. Thus, we are unable to utilize the 
> Hive-generated statistics for partitioned tables. 
> The statistics might not be gathered for all the partitions in Hive. For 
> partial collection, we will not utilize the Hive-generated statistics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org