[
https://issues.apache.org/jira/browse/NIFI-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Payne updated NIFI-911:
----------------------------
Description:
Currently, the default value for the Regular Expression property is (.\*) which
is intended to match any text. However, due to the way that the regular
expression is interpreted, it first matches the entire content of the FlowFile
and then again matches 0 characters at the end of the content (because .* will
match 0 or more characters of any character).
This results in the replacement value being added to the FlowFile twice. For
example, if the replacement value is "Hello World", the output FlowFile will
contain the text "Hello WorldHello World".
We should instead change the regex pattern to (^.*$) to in order to ensure that
we do not duplicate the output
was:
Currently, the default value for the Regular Expression property is (.*) which
is intended to match any text. However, due to the way that the regular
expression is interpreted, it first matches the entire content of the FlowFile
and then again matches 0 characters at the end of the content (because .* will
match 0 or more characters of any character).
This results in the replacement value being added to the FlowFile twice. For
example, if the replacement value is "Hello World", the output FlowFile will
contain the text "Hello WorldHello World".
We should instead change the regex pattern to (^.*$) to in order to ensure that
we do not duplicate the output
> ReplaceText, with default configuration, will cause the replacement to be
> added twice
> -------------------------------------------------------------------------------------
>
> Key: NIFI-911
> URL: https://issues.apache.org/jira/browse/NIFI-911
> Project: Apache NiFi
> Issue Type: Improvement
> Components: Extensions
> Reporter: Mark Payne
> Labels: beginner, newbie
> Fix For: 0.3.0
>
>
> Currently, the default value for the Regular Expression property is (.\*)
> which is intended to match any text. However, due to the way that the regular
> expression is interpreted, it first matches the entire content of the
> FlowFile and then again matches 0 characters at the end of the content
> (because .* will match 0 or more characters of any character).
> This results in the replacement value being added to the FlowFile twice. For
> example, if the replacement value is "Hello World", the output FlowFile will
> contain the text "Hello WorldHello World".
> We should instead change the regex pattern to (^.*$) to in order to ensure
> that we do not duplicate the output
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)