speters33w commented on PR #1201:
URL: https://github.com/apache/commons-lang/pull/1201#issuecomment-2060343679

   > Javadoc says ligatures unhandled. Also I wonder should it state that it 
can’t locale specific replacements like ä->ae and ß->ss
   
   Fixed Javadoc. It will handle ligatures and digraphs per the KD column in 
the [Unicode Normalization 
charts](https://www.unicode.org/charts/normalization/).
   ä will (and should be) be normalized to a. It is not ae in all languages.
   Ligatures and digraphs that don't have compatibility decomposition, such as 
æ or ß will remain unaltered.
   It would be possible to modify convertRemainingAccentCharacters() to include 
characters such as these. It's currently char based so it can't change one char 
to two, but it could be rewritten.
   It would add another iteration to the method and would take me some time to 
do this. 
   Do you think it is necessary?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to