[
https://issues.apache.org/jira/browse/HIVE-24380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mustafa İman updated HIVE-24380:
--------------------------------
Description: NullScanTaskDispatcher does listStatus for hundreds of
partition directories in case of external tables. This is big problem in cloud
installations where directory listings are in object store like S3. We can do
this in parallel. (was: NullScanTaskDispatcher should query listStatus in
parallel as it might take a long time to go through hundreds of partitions
serially. This is big problem in cloud installations where directory listings
are in object store like S3.)
> NullScanTaskDispatcher should liststatus in parallel
> ----------------------------------------------------
>
> Key: HIVE-24380
> URL: https://issues.apache.org/jira/browse/HIVE-24380
> Project: Hive
> Issue Type: Sub-task
> Reporter: Mustafa İman
> Assignee: Mustafa İman
> Priority: Major
>
> NullScanTaskDispatcher does listStatus for hundreds of partition directories
> in case of external tables. This is big problem in cloud installations where
> directory listings are in object store like S3. We can do this in parallel.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)