Samar created NIFI-14324:
----------------------------

             Summary: ExtractText Processor Adds Matched Values many Times 
Incorrectly
                 Key: NIFI-14324
                 URL: https://issues.apache.org/jira/browse/NIFI-14324
             Project: Apache NiFi
          Issue Type: Bug
          Components: NiFi API
    Affects Versions: 1.25.0, 1.15.0
            Reporter: Samar
         Attachments: Screenshot from 2025-03-05 12-57-41.png

The *ExtractText processor* is unexpectedly adding matched values *three times* 
when processing FlowFiles with a regex pattern. Even with a simple pattern like 
{{{}(?s)(.*){}}}, the extracted value is stored *exactly three times* in the 
FlowFile attributes.
h4. *Steps to Reproduce*
 # Create an *ExtractText* processor in NiFi.
 # Set the *Enable repeating capture group* property to {{{}true{}}}.
 # Use a simple regex pattern, such as {{{}(?s)(.*){}}}.
 # Process a FlowFile containing any text.
 # Observe the extracted attributes.

h4. *Expected Behavior*
 * The extracted text should be stored *once* in the attributes when matching 
the entire content.

h4. *Actual Behavior*
 * The extracted text is added {*}three times{*}, leading to duplicate entries.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to