On Fri, 8 May 2026 12:19:59 GMT, Liam Miller-Cushon <[email protected]> wrote:

>> Please consider this fix to the handling of word boundaries (`\b`) with 
>> non-spacing marks with non-BMP base characters, when 
>> `UNICODE_CHARACTER_CLASS` is not set.
>> 
>> See discussion in [JDK-8384082](https://bugs.openjdk.org/browse/JDK-8384082).
>> 
>> ---------
>> - [x] I confirm that I make this contribution in accordance with the 
>> [OpenJDK Interim AI Policy](https://openjdk.org/legal/ai).
>
> Liam Miller-Cushon has updated the pull request incrementally with one 
> additional commit since the last revision:
> 
>   8384137: Regex \b handling of non-spacing marks without 
> UNICODE_CHARACTER_CLASS

I think I found another problem while looking into this. I have started a 
core-libs-dev@ thread to get feedback on whether my analysis here is correct, 
and what the best overall approach would be:

https://mail.openjdk.org/archives/list/[email protected]/thread/52NU77AYQYJ53XUTKHVQ2MKHUPWDHDPN/

-------------

PR Comment: https://git.openjdk.org/jdk/pull/31067#issuecomment-4406606434

Reply via email to