Luigi De Giovanni created NIFI-10869:
----------------------------------------
Summary: ExtractText processor - RegEx captured twice
Key: NIFI-10869
URL: https://issues.apache.org/jira/browse/NIFI-10869
Project: Apache NiFi
Issue Type: Bug
Affects Versions: 1.18.0
Reporter: Luigi De Giovanni
Attachments: image (1).png, image (2).png, image.png
Hi,
There is an issue with the ExtractText processor, in the attempt of creating
FlowFile attributes from FlowFile contents.
When creating a custom property with a regex value that does not contain named
groups, the value captured in the group is added as 2 different attributes.
E.G.
||Property Name||Property Value||Expected FlowFile Attribute||Actual outcome||
|request.param.alpha|<regex for
alpha>|request.param.alpha=alpha|request.param.alpha=alpha
request.param.alpha.1=alpha|
Reading the documentation, this might even be an expected behaviour, but if so,
it is preferable to have the captured value only as a single attribute, without
duplication.
Please see the attachment for a real example
--
This message was sent by Atlassian Jira
(v8.20.10#820010)