[
https://issues.apache.org/jira/browse/NIFI-10869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luigi De Giovanni updated NIFI-10869:
-------------------------------------
Priority: Minor (was: Major)
> ExtractText processor - RegEx captured twice
> --------------------------------------------
>
> Key: NIFI-10869
> URL: https://issues.apache.org/jira/browse/NIFI-10869
> Project: Apache NiFi
> Issue Type: Improvement
> Affects Versions: 1.18.0
> Reporter: Luigi De Giovanni
> Priority: Minor
> Attachments: image (1).png, image (2).png, image.png
>
>
> Hi,
> There is an issue with the ExtractText processor, in the attempt of creating
> FlowFile attributes from FlowFile contents.
> When creating a custom property with a regex value that does not contain
> named groups, the value captured in the group is added as 2 different
> attributes.
> E.G.
> ||Property Name||Property Value||Expected FlowFile Attribute||Actual outcome||
> |param.alpha|<regex for alpha>|param.alpha=alpha|param.alpha=alpha
> param.alpha.1=alpha|
> Reading the documentation, this might even be an expected behaviour, but if
> so, it is preferable to have the captured value only as a single attribute,
> without duplication.
> Please see the attachment for an example.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)