[ https://issues.apache.org/jira/browse/HADOOP-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559723#action_12559723 ]
Doug Cutting commented on HADOOP-2566: -------------------------------------- > Is this what we wanted? I thought we wanted other way around. I don't think it does that in all cases, but it does still appear to call getStatus() in places. I've not yet examined the logic to see if that's easily avoidable or not. But it's not a fatal problem at this point. For this release the important thing is to have globStatus() as the preferred, non-deprecated method. Once we remove the status cache, during 0.17 development, we'll soon find out whether the globStatus() implementation needs more work to perform well without a cache, and fix that before 0.17 is released. But that aspect shouldn't block this for 0.16, since we still have the cache in 0.16. > need FileSystem#globStatus method > --------------------------------- > > Key: HADOOP-2566 > URL: https://issues.apache.org/jira/browse/HADOOP-2566 > Project: Hadoop > Issue Type: Improvement > Components: fs > Reporter: Doug Cutting > Assignee: Hairong Kuang > Fix For: 0.16.0 > > Attachments: globStatus.patch, globStatus1.patch > > > To remove the cache of FileStatus in DFSPath (HADOOP-2565) without hurting > performance, we must use file enumeration APIs that return FileStatus[] > rather than Path[]. Currently we have FileSystem#globPaths(), but that > method should be deprecated and replaced with a FileSystem#globStatus(). > We need to deprecate FileSystem#globPaths() in 0.16 in order to remove the > cache in 0.17. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.