dave-csc commented on issue #4585:
URL: https://github.com/apache/hop/issues/4585#issuecomment-2485336229

   Hi @hansva,
   
   for my tests I used https://regex101.com/ and it gave the results described 
in the first post.
   
   The difference is in what we intend as a "match" (if the input matches the 
pattern _in whole_, or just _contains_ it), and even Java itself offers various 
methods to determine the different types of match (cfr. 
https://www.baeldung.com/java-matcher-find-vs-matches). In this case, probably 
a slight documentation improvement is needed.
   
   I don't know which implementation is "safer" for data analysis, but to 
specify a full string match I would explicitly supply a RegEx with markers like 
`^...$`. Without those markers I'd expect to check if the pattern is just 
contained in the string.
   
   BTW, my goal was actually the extract the `PROCESSING_TAG` text (without 
parentheses). I made it with the Regex 1 above, when using Regex 2 I got the 
unexpected (for me) no match, hence the bug report.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to