[ 
https://issues.apache.org/jira/browse/HADOOP-3498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White updated HADOOP-3498:
------------------------------

    Attachment: hadoop-3498-v2.patch

Hairong, Thanks for the feedback.

I wanted to avoid expanding all alternations so that we didn't get an explosion 
of paths for things like:

{a,b,c,d,e}{f,g,h,i,j}{k,l,m,n,o}

This would continue to be processed as a single path and matched using regular 
expressions.

The patch does scan filePattern multiple times, so I've changed it so that it 
doesn't scan previously-expanded parts of the path. This change dramatically 
reduces the number of character scans.

> File globbing alternation should be able to span path components
> ----------------------------------------------------------------
>
>                 Key: HADOOP-3498
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3498
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Tom White
>            Assignee: Tom White
>         Attachments: hadoop-3498-v2.patch, hadoop-3498.patch
>
>
> For example, {/a/b,/c/d} should expand to /a/b and /c/d. This change would 
> also permit a consistent syntax for specifying multiple input paths to 
> MapReduce, streaming and Pig by specification of a single glob path with 
> alternation {/a/b,/c/d}, rather than a collection of comma separated glob 
> paths /a/b,/c/d.
> This change would also make globbing more consistent with bash, which 
> supports this feature.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to