[
https://issues.apache.org/jira/browse/SPARK-35437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17347017#comment-17347017
]
Apache Spark commented on SPARK-35437:
--------------------------------------
User 'cxzl25' has created a pull request for this issue:
https://github.com/apache/spark/pull/32583
> Hive partition filtering client optimization
> --------------------------------------------
>
> Key: SPARK-35437
> URL: https://issues.apache.org/jira/browse/SPARK-35437
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 3.1.1
> Reporter: dzcxzl
> Priority: Minor
>
> When we have a table with a lot of partitions and there is no way to filter
> it on the MetaStore Server, we will get all the partition details and filter
> it on the client side. This is slow and puts a lot of pressure on the
> MetaStore Server.
> We can first pull all the partition names, filter by expressions, and then
> obtain detailed information about the corresponding partitions from the
> MetaStore Server.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]