metsw24-max opened a new pull request, #699: URL: https://github.com/apache/logging-log4cxx/pull/699
The four-byte branch masks the lead byte with `& 0x07` and then only checks the resulting code point range, so lead bytes F8..FF (never valid UTF-8) keep their low three bits and decode to in-range code points instead of being rejected. `F8 BF BF BF` aliases to U+3FFFF, FB/FC to other planes, the same substitution-collision the existing F5..F7 bound was added to stop. Gate acceptance on `(ch1 & 0xF8) == 0xF0` so only F0..F7 reach the range check. Added transcodertestcase coverage for the F8/FB/FC/FF leads. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
