[ 
https://issues.apache.org/jira/browse/HADOOP-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

lohit vijayarenu updated HADOOP-2151:
-------------------------------------

    Attachment: HADOOP-2151.patch

Attached is a patch which addresses this problem. globPaths basically checks if 
the path exits by calling exists() for only those paths which were expanded via 
listPaths in previous iteration. This is done by passing a new flag for the 
recursive function to indicating if previous component was glob or not.

> FileSyste.globPaths does not validate the return list of Paths
> --------------------------------------------------------------
>
>                 Key: HADOOP-2151
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2151
>             Project: Hadoop
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.14.3, 0.15.0
>            Reporter: lohit vijayarenu
>             Fix For: 0.16.0
>
>         Attachments: HADOOP-2151.patch
>
>
> FileSystem.globPaths does not validate the return list of Paths.
> Here is an example. 
> Consider a directory structure like
> /user/foo/DIR1/FILE1
> /user/foo/DIR2
> now if we pass an input path like "/user/foo/*/FILE1" to 
> FileSystem.globPaths()
> It returns 2 entries as shown below
> /user/foo/DIR1/FILE1
> /user/foo/DIR2/FILE1
> Should globPaths validate this and return only valid Paths? This behavior was 
> caught in FileSystem.validateInput() where an IOException is thrown while 
> processing such a directory structure.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to