[
https://issues.apache.org/jira/browse/NIFI-4524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jeff Storck reassigned NIFI-4524:
---------------------------------
Assignee: (was: Jeff Storck)
> ListHDFS should allow for glob patterns
> ---------------------------------------
>
> Key: NIFI-4524
> URL: https://issues.apache.org/jira/browse/NIFI-4524
> Project: Apache NiFi
> Issue Type: Improvement
> Components: Core Framework
> Affects Versions: 1.4.0
> Reporter: Benjamin Garrett
> Priority: Minor
>
> ListHDFS does not support glob patterns. If it did, then I could consolidate
> a bunch of ListHDFS instances in my flow down to a single ListHDFS instance.
> This is because there are some directory structures that are easier to deal
> with as a glob:
> /data1/avro
> /data1/orc
> /data2/avro
> /data2/orc
> If I wanted to process all 'avro' dirs, then I could use a glob like this:
> /*/avro
> But since ListHDFS doesn't support a glob, I need many ListHDFS instances for
> this situation which is more to maintain.
> Fortunately org.apache.hadoop.fs.FileSystem supports a globStatus()
> alternative to listStatus().
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)