https://bugzilla.wikimedia.org/show_bug.cgi?id=57628

--- Comment #3 from David Chan <[email protected]> ---
Hi, yes, we wrote ve.splitClusters because we wanted to make the document model
be a list of grapheme clusters, instead of a list of raw Javascript characters
(i.e. Unicode code units, so each surrogate pair like '\uD860\uDEE2' is treated
as two separate entities '\uD860' and '\uDEE2').

However, we've subsequently decided against that, because browsers will not
always agree what constitutes a grapheme cluster. The example of Malayalam,
where the font can affect the number of clusters, is one example of how
problematic it could be to try to match the browser's clusterings exactly.

Therefore, the DM is to remain a list of raw Javascript characters, and support
related to clustering is being developed at a level on top of the DM.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to