[ 
https://issues.apache.org/jira/browse/HADOOP-8906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13473802#comment-13473802
 ] 

Jason Lowe commented on HADOOP-8906:
------------------------------------

Thanks for the updates, Daryn.  I'm eager to see the Jenkins results, although 
it seems the Jenkins build is stuck right now.

After a closer look, I'm wondering if there's one more subtle difference 
between the old and new versions, this time when the user specifies a filter.  
I think the old version will return null in the case of an non-globbed path 
that found a file but it didn't pass the specified filter.  It applies the 
filter as it searches before it checks for the empty-array-should-return-null 
case.  In the new version, it applies the specified filter *after* it checks 
for whether there are matches, and in this scenario there will be a match since 
the filter hasn't been applied yet.  I'm wondering if we should apply the 
filter before checking for an empty match array to align with the old behavior. 
 Granted it's odd to provide a non-globbed path that wouldn't pass the filter, 
so it's a corner-case.

Otherwise everything looks great, pending Jenkins since it found some things in 
the last run that we missed.
                
> paths with multiple globs are unreliable
> ----------------------------------------
>
>                 Key: HADOOP-8906
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8906
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs
>    Affects Versions: 0.23.0, 2.0.0-alpha, 3.0.0
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>         Attachments: HADOOP-8906.patch, HADOOP-8906.patch, HADOOP-8906.patch, 
> HADOOP-8906.patch
>
>
> Let's say we have have a structure of "$date/$user/stuff/file".  Multiple 
> globs are unreliable unless every directory in the structure exists.
> These work:
> date*/user
> date*/user/stuff
> date*/user/stuff/file
> These fail:
> date*/user/*
> date*/user/*/*
> date*/user/stu*
> date*/user/stu*/*
> date*/user/stu*/file
> date*/user/stuff/*
> date*/user/stuff/f*

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to