[
https://issues.apache.org/jira/browse/NIFI-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528624#comment-17528624
]
Phil Hawkins commented on NIFI-1517:
------------------------------------
I have created a fix for this issue [https://github.com/gippyphil/nifi/pull/1]
> Allow SplitContent to be split on a regular expression
> ------------------------------------------------------
>
> Key: NIFI-1517
> URL: https://issues.apache.org/jira/browse/NIFI-1517
> Project: Apache NiFi
> Issue Type: Improvement
> Components: Extensions
> Reporter: Ian Hellstrom
> Priority: Minor
>
> Currently SplitContent allows HEX and text sequences to be added. However, it
> is sometimes necessary to split on alternatives or based on different
> sections of a log file (sometimes indicated by "[SOME_TEXT]"), where the
> section name SOME_TEXT can obviously vary. Hence, regular expressions (or EL)
> should be allowed in the SplitContent processor when using the "text" option.
> It would also be great if it's possible to immediately extract relevant
> information from the split. For instance, create a RegEx that back-references
> SOME_TEXT in the aforementioned example. That way you could split files based
> on section markers yet immediately get rid of these markers. This additional
> request is a nice-to-have feature.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)