Alessandro D'Armiento created NIFI-6464:
-------------------------------------------

             Summary: ListHDFS should support fragment attributes with 
strategies
                 Key: NIFI-6464
                 URL: https://issues.apache.org/jira/browse/NIFI-6464
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Core Framework
    Affects Versions: 1.9.2
            Reporter: Alessandro D'Armiento


h2. Current Situation

ListHDFS doesn't support Fragmentation attributes

h2. Improvement Proposal

 * Since the processor works on a 1:N semantic (1 input trigger flowfile, N 
output flowfiles) it would be nice to support fragmentation attributes (for 
example for subsequent merge operations)
 ** It would be also useful to support different fragmentation strategies, in 
order to support multiple user cases. For example, it should be possible to 
select:
 *** A "one for all" fragmentation strategy which will create a single 
fragmentation group. Therefore, all files will have the same 
fragment.identifier, the same fragment.count, equal to the total number N of 
listed files, and fragment.index ∈ [0, N).
 *** A "per subdir" fragmentation strategy which will create different 
fragmentation groups, one for each scanned subdirectory of the given path. 
Therefore, for each subfolder, flowfiles will have a specific 
fragment.identifier, fragment.count will be, for each flowfile, equal to the 
number Ni of files in the i-th directory, and fragment.index ∈ [0, Ni).



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to