Status: Untriaged
Owner: [email protected]
CC: [email protected]
Labels: Type-Bug Pri-2 OS-All Area-BrowserUI Area-BrowserBackend I18N

New issue 7796 by [email protected]: CollapseWhitespace replaces ZWNJ with  
U+0020 (ascii space)  in the middle of a string
http://code.google.com/p/chromium/issues/detail?id=7796

CollapseWhitespace replaces a sequence of 'whitespace characters' with a
single ascii space (U+0020).  It relies on IsWhitespace in string_util.cc,
which in turn uses kWhitespaceWide to tell if a given character is
whitespace or not. kWhitespaceWide contains ZWNJ and replacing it with
U+0020 is problematic for some Arabic and Indian words.

Ref: Unicode 5.0, section 8.2 (p.270 - 271) and section 16.2 (p.538 -
p.539).





--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

--~--~---------~--~----~------------~-------~--~----~
Automated mail from issue updates at http://crbug.com/
Subscription options: http://groups.google.com/group/chromium-bugs
-~----------~----~----~----~------~----~------~--~---

Reply via email to