Andy LoPresto created NIFI-4095:
-----------------------------------
Summary: ExtractText should not require a capture group in every
regular expression
Key: NIFI-4095
URL: https://issues.apache.org/jira/browse/NIFI-4095
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Affects Versions: 1.3.0
Reporter: Andy LoPresto
The {{ExtractText}} processor currently validates every regular expression and
requires that it contain "between 1 and 40 capture groups". This seems to be a
design decision, as the values are hardcoded into the
[validator|https://github.com/apache/nifi/blob/master/nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ExtractText.java#L262-L262],
but there are valid regular expressions that do not need an explicit capture
group (especially when the expression is small and the full expression is the
desired match). This results in unnecessary duplicate matches ("some_attr" and
"some_attr.1" being identical).
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)