[
https://issues.apache.org/jira/browse/HADOOP-17491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Steve Loughran updated HADOOP-17491:
------------------------------------
Description:
Azure storage and the abfs client offer recursive listings, but currently
there's no listFiles() method to connect to it instead a recursive treewalk is
used.
Implementing a listFiles(path, recursive) could offer significant speedups to
applications which use it, for example HIVE-24669, and again, async prefetching
could provide even more benefits
Would need changes in
{{org.apache.hadoop.fs.azurebfs.AzureBlobFileSystemStore}} to join everything up
was:
Azure storage and the abfs client offer recursive listings, but currently
there's no listFiles() method to connect to it; instead the base FileSystem
implementation is used, which uses listStatus and treewalks.
Implementing a listFiles(path, recursive) could offer significant speedups to
applications which use it, for example HIVE-24669.
> Implement listFiles(path, recursive=true)
> -----------------------------------------
>
> Key: HADOOP-17491
> URL: https://issues.apache.org/jira/browse/HADOOP-17491
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/azure
> Affects Versions: 3.3.0
> Reporter: Steve Loughran
> Priority: Major
>
> Azure storage and the abfs client offer recursive listings, but currently
> there's no listFiles() method to connect to it instead a recursive treewalk
> is used.
> Implementing a listFiles(path, recursive) could offer significant speedups to
> applications which use it, for example HIVE-24669, and again, async
> prefetching could provide even more benefits
> Would need changes in
> {{org.apache.hadoop.fs.azurebfs.AzureBlobFileSystemStore}} to join everything
> up
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]