Ian Hellstrom created NIFI-1517:
-----------------------------------

             Summary: Allow SplitContent to be split on a regular expression
                 Key: NIFI-1517
                 URL: https://issues.apache.org/jira/browse/NIFI-1517
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Extensions
            Reporter: Ian Hellstrom
            Priority: Minor


Currently SplitContent allows HEX and text sequences to be added. However, it 
is sometimes necessary to split on alternatives or based on different sections 
of a log file (sometimes indicated by "[SOME_TEXT]"), where the section name 
SOME_TEXT can obviously vary. Hence, regular expressions (or EL) should be 
allowed in the SplitContent processor when using the "text" option.

It would also be great if it's possible to immediately extract relevant 
information from the split. For instance, create a RegEx that back-references 
SOME_TEXT in the aforementioned example. That way you could split files based 
on section markers yet immediately get rid of these markers. This additional 
request is a nice-to-have feature.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to