[
https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833209#comment-16833209
]
angerszhu commented on SPARK-27602:
-----------------------------------
[~hyukjin.kwon]
The first step result is just like this. The implementation is not very
elegant since for multi partition hive scan, we must re-calculate the column
stats
!image-2019-05-05-11-46-41-240.png!
> SparkSQL CBO can't get true size of partition table after partition pruning
> ---------------------------------------------------------------------------
>
> Key: SPARK-27602
> URL: https://issues.apache.org/jira/browse/SPARK-27602
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 2.2.0, 2.3.0, 2.4.0
> Reporter: angerszhu
> Priority: Major
> Attachments: image-2019-05-05-11-46-41-240.png
>
>
> When I want to do extract a cost of one sql for myself's cost framework, I
> found that CBO can't get true size of partition table since when partition
> pruning is true. we just need corresponding partition's size. It just use the
> tables's statistic.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]