[
https://issues.apache.org/jira/browse/HADOOP-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559760#action_12559760
]
Hairong Kuang commented on HADOOP-2566:
---------------------------------------
Thanks Raghu! You comment relieved my mind. Don't want to have a wrong
algorithm right before the feature freeze.
Regarding the regression, yes, I removed what HADOOP-2151 did because I think
it is not efficient to call exist for each component when there is a glob on
the path. My algorithm depends on getFileStatus to throw an exception that
indicates an non-existent path. It works on dfs. But
LocalFileSystem.getFileStatus returns a valid FileStatus object. I will fix
this. I'd like to change the semantics of getFileStaus to return null on a
non-existent path. Thanks for helping me test this feature.
> need FileSystem#globStatus method
> ---------------------------------
>
> Key: HADOOP-2566
> URL: https://issues.apache.org/jira/browse/HADOOP-2566
> Project: Hadoop
> Issue Type: Improvement
> Components: fs
> Reporter: Doug Cutting
> Assignee: Hairong Kuang
> Fix For: 0.16.0
>
> Attachments: globStatus.patch, globStatus1.patch
>
>
> To remove the cache of FileStatus in DFSPath (HADOOP-2565) without hurting
> performance, we must use file enumeration APIs that return FileStatus[]
> rather than Path[]. Currently we have FileSystem#globPaths(), but that
> method should be deprecated and replaced with a FileSystem#globStatus().
> We need to deprecate FileSystem#globPaths() in 0.16 in order to remove the
> cache in 0.17.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.