Holger Frydrych created NIFI-4434:
-------------------------------------

             Summary: ListHDFS applies File Filter also to subdirectory names 
in recursive search
                 Key: NIFI-4434
                 URL: https://issues.apache.org/jira/browse/NIFI-4434
             Project: Apache NiFi
          Issue Type: Bug
    Affects Versions: 1.3.0
            Reporter: Holger Frydrych


The File Filter regex configured in the ListHDFS processor is applied not just 
to files found, but also to subdirectories. 

If you try to set up a recursive search to list e.g. all csv files in a 
directory hierarchy via a regex like ".*\.csv", it will only pick up csv files 
in the base directory, not in any subdirectory. This is because subdirectories 
don't typically match that regex pattern.

To fix this, either subdirectories should not be matched against the file 
filter, or the file filter should be applied to the full path of all files 
(relative to the base directory). The GetHDFS processor offers both options via 
a switch.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to