On Thu, 15 May 2025 02:25:30 GMT, Sergey Bylokhov <s...@openjdk.org> wrote:

>> I believe it is OK to leave these as UTF-8 native characters, as these files 
>> are l10n resource bundles. If we wanted to replace those look-alike spaces 
>> to unicode escapes, other characters may also need the same treatment, such 
>> as hyphen-minus, quotations, etc. In fact there are lot more look alikes 
>> defined in the unicode consortium 
>> (https://www.unicode.org/Public/security/latest/confusables.txt), and I 
>> don't think we would want to convert them.
>
> maybe this is just a translation error and a simple space can be used 
> instead, like in all the other properties in these files?

Maybe, but sometimes it is intentional. CLDR has once switched normal spaces to 
NBSP/NNBSP for certain locales 
(https://unicode-org.atlassian.net/browse/CLDR-14032). And we cannot tell if it 
is intentional or not.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/25234#discussion_r2090140891

Reply via email to