[
https://issues.apache.org/jira/browse/HADOOP-4747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652225#action_12652225
]
electrum edited comment on HADOOP-4747 at 12/1/08 5:04 PM:
-----------------------------------------------------------------
Change main ls() function and shellListStatus() to take a FileStatus rather
than a Path. shellListStatus() can return immediately if the FileStatus is not
a directory.
This has significant speedups for -ls when globbing:
hadoop fs -ls /foo/ (fast)
hadoop fs -ls /foo/*.dat (previously made a status call for every file)
was (Author: electrum):
Change main ls() function and shellListStatus() to take a FileStatus rather
than a Path. shellListStatus() can return immediately if the FileStatus is a
directory.
This has significant speedups for -ls when globbing:
hadoop fs -ls /foo/ (fast)
hadoop fs -ls /foo/*.dat (previously made a status call for every file)
> Reuse FileStatus in FsShell where possible
> ------------------------------------------
>
> Key: HADOOP-4747
> URL: https://issues.apache.org/jira/browse/HADOOP-4747
> Project: Hadoop Core
> Issue Type: Bug
> Components: fs
> Affects Versions: 0.19.0
> Reporter: David Phillips
> Priority: Minor
> Attachments: hadoop-fsshell-reuse-status.patch
>
>
> FsShell should reuse FileStatus objects instead of converting to a Path and
> making extra calls to the backend FS (which can be slow and expensive).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.