[ 
https://issues.apache.org/jira/browse/NIFI-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528624#comment-17528624
 ] 

Phil Hawkins commented on NIFI-1517:
------------------------------------

I have created a fix for this issue [https://github.com/gippyphil/nifi/pull/1]

 

> Allow SplitContent to be split on a regular expression
> ------------------------------------------------------
>
>                 Key: NIFI-1517
>                 URL: https://issues.apache.org/jira/browse/NIFI-1517
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Ian Hellstrom
>            Priority: Minor
>
> Currently SplitContent allows HEX and text sequences to be added. However, it 
> is sometimes necessary to split on alternatives or based on different 
> sections of a log file (sometimes indicated by "[SOME_TEXT]"), where the 
> section name SOME_TEXT can obviously vary. Hence, regular expressions (or EL) 
> should be allowed in the SplitContent processor when using the "text" option.
> It would also be great if it's possible to immediately extract relevant 
> information from the split. For instance, create a RegEx that back-references 
> SOME_TEXT in the aforementioned example. That way you could split files based 
> on section markers yet immediately get rid of these markers. This additional 
> request is a nice-to-have feature.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to