[
https://issues.apache.org/jira/browse/HIVE-22964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17052059#comment-17052059
]
Aditya Shah edited comment on HIVE-22964 at 3/5/20, 11:59 AM:
--------------------------------------------------------------
Hi [~pvary],
* I've propagated ugi considering HIVE-13120. Hence a separate class too.
* MMPathInfo is required as we'll populate two lists shared across threads.
I will correct considering the rest of your comments and upload a patch again.
Thanks!
was (Author: aditya-shah):
Hi [~pvary],
* I've propagated ugi considering HIVE-13120. Hence a separate class too.
* MMPathInfo is required as we'll populate two lists shared across threads.
I cill correct considering the rest of your comments and upload a patch again.
Thanks!
> MM table split computation is very slow
> ---------------------------------------
>
> Key: HIVE-22964
> URL: https://issues.apache.org/jira/browse/HIVE-22964
> Project: Hive
> Issue Type: Improvement
> Reporter: Aditya Shah
> Assignee: Aditya Shah
> Priority: Major
> Attachments: HIVE-22964.patch
>
>
> Since for MM table we process the paths prior to inputFormat.getSplits() we
> end up doing listing on the whole table at once. This could be optimized.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)