Samar created NIFI-14324:
----------------------------
Summary: ExtractText Processor Adds Matched Values many Times
Incorrectly
Key: NIFI-14324
URL: https://issues.apache.org/jira/browse/NIFI-14324
Project: Apache NiFi
Issue Type: Bug
Components: NiFi API
Affects Versions: 1.25.0, 1.15.0
Reporter: Samar
Attachments: Screenshot from 2025-03-05 12-57-41.png
The *ExtractText processor* is unexpectedly adding matched values *three times*
when processing FlowFiles with a regex pattern. Even with a simple pattern like
{{{}(?s)(.*){}}}, the extracted value is stored *exactly three times* in the
FlowFile attributes.
h4. *Steps to Reproduce*
# Create an *ExtractText* processor in NiFi.
# Set the *Enable repeating capture group* property to {{{}true{}}}.
# Use a simple regex pattern, such as {{{}(?s)(.*){}}}.
# Process a FlowFile containing any text.
# Observe the extracted attributes.
h4. *Expected Behavior*
* The extracted text should be stored *once* in the attributes when matching
the entire content.
h4. *Actual Behavior*
* The extracted text is added {*}three times{*}, leading to duplicate entries.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)