https://issues.apache.org/ooo/show_bug.cgi?id=120598

[email protected] changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|CONFIRMED                   |RESOLVED
         Resolution|---                         |FIXED

--- Comment #4 from [email protected] ---
The new ICU-regexp engine has many benefits such as much improved unicode
support and support back references, non-greed matches, look-ahead,
look-behind, match count limits, graphemes, atomic matches, non-capturing
matches, possessive matches etc. It didn't directly support the GRE-only \< and
\> word-break operators though, so they were emulated by plain word-break
operators.

The fix just commited improves the emulation considerably using the newer
engines look-around features by emulating the
- word-start operator as \b(?=\w)
- word-end operator as (?<=\w)\b

Until this gets into the next release the workaround for the issue reporter is
to replace the "\< with "\b(?=\w) and the \>" with (?<=\w)\b

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to