[ 
https://issues.apache.org/jira/browse/NIFI-4524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Storck reassigned NIFI-4524:
---------------------------------

    Assignee: Jeff Storck

> ListHDFS should allow for glob patterns
> ---------------------------------------
>
>                 Key: NIFI-4524
>                 URL: https://issues.apache.org/jira/browse/NIFI-4524
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Core Framework
>    Affects Versions: 1.4.0
>            Reporter: Benjamin Garrett
>            Assignee: Jeff Storck
>            Priority: Minor
>
> ListHDFS does not support glob patterns.  If it did, then I could consolidate 
> a bunch of ListHDFS instances in my flow down to a single ListHDFS instance.  
> This is because there are some directory structures that are easier to deal 
> with as a glob:
> /data1/avro
> /data1/orc
> /data2/avro
> /data2/orc
> If I wanted to process all 'avro' dirs, then I could use a glob like this:  
> /*/avro
> But since ListHDFS doesn't support a glob, I need many ListHDFS instances for 
> this situation which is more to maintain.
> Fortunately org.apache.hadoop.fs.FileSystem supports a globStatus() 
> alternative to listStatus().



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to