[
https://issues.apache.org/jira/browse/HIVE-24717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17277866#comment-17277866
]
Steve Loughran commented on HIVE-24717:
---------------------------------------
happy to review a hadoop PR with the relevant fix backported
> Migrate to listStatusIterator in moving files
> ---------------------------------------------
>
> Key: HIVE-24717
> URL: https://issues.apache.org/jira/browse/HIVE-24717
> Project: Hive
> Issue Type: Improvement
> Reporter: Mustafa İman
> Assignee: Mustafa İman
> Priority: Major
> Labels: pull-request-available
> Time Spent: 40m
> Remaining Estimate: 0h
>
> Hive.java has various calls to hdfs listStatus call when moving
> files/directories around. These codepaths are used for insert overwrite
> table/partition queries.
> listStatus It is blocking call whereas listStatusIterator is backed by a
> RemoteIterator and fetches pages in the background. Hive should take
> advantage of that since Hadoop has implemented listStatusIterator for S3
> recently https://issues.apache.org/jira/browse/HADOOP-17074
--
This message was sent by Atlassian Jira
(v8.3.4#803005)