wecharyu opened a new pull request, #39115:
URL: https://github.com/apache/spark/pull/39115

   ### What changes were proposed in this pull request?
   
   We would like to expand the `MSCK REPAIR TABLE` statement to support 
partition filter with new options:
    `MSCK REPAIR TABLE table_identifier [{ADD|DROP|SYNC} PARTITIONS 
[partition_spec]]`
   **partition_spec**:  ` ( partition_col_name = partition_col_val [ , ... ] )`
   
   ### Why are the changes needed?
   1. Listing all partitions may hit the request limit in some metastore like 
Hive metastore.
   2. Partition filter can improve the performance if we only need repair the 
specific partitions.
   
   
   ### Does this PR introduce _any_ user-facing change?
   Yes, we update the document of `MSCK REPAIR TABLE` for the new partition 
filter options.
   
   
   ### How was this patch tested?
   UTs.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to