Jasper Knulst created NIFI-6286:
-----------------------------------

             Summary: Make listHDFS work as INPUT_ALLOWED processor
                 Key: NIFI-6286
                 URL: https://issues.apache.org/jira/browse/NIFI-6286
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Core Framework
    Affects Versions: 1.9.2
            Reporter: Jasper Knulst


Currently the listHDFS processor has a prop 'Directory' (to start the listing 
from, recursively or not) which only allows 1 static value.

There are many use cases where you would want to crawl many roots in sequence. 
There are 2 ways to do it.
 # Allow the 'Directory' prop to have multiple comma separated values
 # Refactor listHDFS as an INPUT_ALLOWED processor and make the 'Directory' 
prop take EL to get directory roots from upstream

Option 1. has serious restrictions since it dictates that other config (like 
recursive, filter type and regex) would still be static and may get very 
complex, non-intuitive and require frequent re-configuration.

Option 2. is the way to go.

Some things to consider:

-The way listHDFS behaves now should be preserved

-It makes sense to dynamically set 'Directory', 'Recursiveness', 'Regex' and 
'Filter type' in tandem      to be able to detail the way each root directory 
is crawled

-Switching 'Directory' also requires that not just 1 state is stored but states 
for each directory that ever passed



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to