Ian Hellstrom created NIFI-1517:
-----------------------------------
Summary: Allow SplitContent to be split on a regular expression
Key: NIFI-1517
URL: https://issues.apache.org/jira/browse/NIFI-1517
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Ian Hellstrom
Priority: Minor
Currently SplitContent allows HEX and text sequences to be added. However, it
is sometimes necessary to split on alternatives or based on different sections
of a log file (sometimes indicated by "[SOME_TEXT]"), where the section name
SOME_TEXT can obviously vary. Hence, regular expressions (or EL) should be
allowed in the SplitContent processor when using the "text" option.
It would also be great if it's possible to immediately extract relevant
information from the split. For instance, create a RegEx that back-references
SOME_TEXT in the aforementioned example. That way you could split files based
on section markers yet immediately get rid of these markers. This additional
request is a nice-to-have feature.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)