alhudz opened a new pull request, #756:
URL: https://github.com/apache/commons-text/pull/756
Port of apache/commons-lang#1730 to the Commons Text copy of
`TextStringBuilder`, as requested by @garydgregory.
`TextStringBuilder.reverse()` swaps the buffer one `char` at a time, so
every surrogate pair is left in low-high order and a supplementary code point
becomes malformed UTF-16. `StringBuilder`/`StringBuffer`, which this class is
documented to mimic, reverse the same input correctly.
Repro: `new TextStringBuilder("a😀b").reverse().toString()` (`a`, `U+1F600`,
`b`).
Before: `b\uDE00\uD83Da`, a low surrogate ahead of its high surrogate.
After: `b😀a`, matching `new StringBuilder("a😀b").reverse()`.
Fix: after the char swap, walk the buffer once and swap each adjacent
low-high surrogate pair back to high-low, gated on whether any surrogate was
seen during the swap. BMP text, lone unpaired surrogates and odd-length buffers
are untouched.
Added `TextStringBuilderTest#testReverseSurrogatePairs`, which fails on the
current tree and passes with the fix.
- [x] Read the [contribution guidelines](CONTRIBUTING.md) for this project.
- [ ] Read the [ASF Generative Tooling
Guidance](https://www.apache.org/legal/generative-tooling.html) if you use
Artificial Intelligence (AI).
- [ ] I used AI to create any part of, or all of, this pull request. Which
AI tool was used to create this pull request, and to what extent did it
contribute?
- [x] Run a successful build using the default
[Maven](https://maven.apache.org/) goal with `mvn`; that's `mvn` on the command
line by itself.
- [x] Write unit tests that match behavioral changes, where the tests fail
if the changes to the runtime are not applied. This may not always be possible,
but it is a best practice.
- [x] Write a pull request description that is detailed enough to understand
what the pull request does, how, and why.
- [x] Each commit in the pull request should have a meaningful subject line
and body. Note that a maintainer may squash commits during the merge process.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]