[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Andre Klapper aklap...@wikimedia.org changed: What|Removed |Added Status|ASSIGNED|NEW --- Comment #60 from Andre Klapper aklap...@wikimedia.org --- Amir: Do you (or the L10N team) plan to take a look at this at some point? This ticket is place 14 in the list of open tickets with the highest votes... -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #59 from Dovi Jacobs dovijac...@yahoo.com --- For an extremely clear description of the problem in Hebrew, see here (pp. 8 ff.): http://www.sbl-site.org/Fonts/SBLHebrewUserManual1.5x.pdf -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 matanya matanya.mo...@gmail.com changed: What|Removed |Added CC||matanya.mo...@gmail.com AssignedTo|br...@wikimedia.org |amir.ahar...@mail.huji.ac.i ||l --- Comment #58 from matanya matanya.mo...@gmail.com 2012-07-30 13:53:46 UTC --- reassigned to Amir as he is part of localization engineers. This bug is still present as can seen in : https://en.wikisource.org/wiki/User:Amire80/Havrakha -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Jan Kucera (Kozuch) garba...@seznam.cz changed: What|Removed |Added Priority|Lowest |Highest CC||garba...@seznam.cz --- Comment #58 from Jan Kucera (Kozuch) garba...@seznam.cz 2011-12-30 15:46:32 UTC --- Because of votes rasing importance/priority according to following scheme: 15+ votes - highest 5-15 votes - high Community must have a voice within development. Regards, Kozuch http://en.wikipedia.org/wiki/User:Kozuch -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Bugmeister Bot mhershber...@wikimedia.org changed: What|Removed |Added Priority|Highest |Lowest CC|garba...@seznam.cz | -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Ryan Kaldari rkald...@wikimedia.org changed: What|Removed |Added CC||rkald...@wikimedia.org --- Comment #57 from Ryan Kaldari rkald...@wikimedia.org 2011-12-08 21:39:36 UTC --- This should probably be reassigned to one of our localization engineers. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 merelo...@gmail.com changed: What|Removed |Added CC||merelo...@gmail.com --- Comment #56 from merelo...@gmail.com 2011-09-29 12:48:02 UTC --- *** Bug 31183 has been marked as a duplicate of this bug. *** -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Krinkle krinklem...@gmail.com changed: What|Removed |Added Blocks|3860|1527 -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Krinkle krinklem...@gmail.com changed: What|Removed |Added Blocks|30672 |30673 -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Siebrand s.mazel...@xs4all.nl changed: What|Removed |Added CC||s.mazel...@xs4all.nl Blocks||30672 -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Philippe Verdy verd...@wanadoo.fr changed: What|Removed |Added CC||verd...@wanadoo.fr --- Comment #52 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 15:58:14 UTC --- Apparently, you have not implemnted the contractions and expansions of UCA. Note that there has been NO change in Unicode 5.1 (or later) for the normalization which is now stabilized since at least Unicode 4.0.1. The bugs above are most probably not related to normalization, if it is implemented correctly (and normalization is an easy problem that can be implemtned very efficiently). And the changes in the DUCET (or now the CLDR DUCET) do not affect how Hebrew, Arabic or Myanmar is sorted, within the same script. Then you should learn to separate the Unicode Normalization Algorithm (UNA), the Unicode Collation Algorithm (UCA), and the Unicode Bidi Algorithm (UBA), because the Bidi algorithm only affects the display, but definitely NOT the other two. And the order produced by normalization is orthogonal to the order of collation weights generated by UCA, even if normalization is assumed to be performed first before computing collations (but this is not a requirement, it just helps reducing the problem, by making sure that canonically equivalent strings will collate the same. Many posters above seem to be completely mixing the problems ! -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #53 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:00:22 UTC --- Note: for Thai, Lao, Tai Viet, the normalization does not reorder the prepended vowels (neither do the Bidi algorithm). But such reordering is *required* when implementing the UCA, and this takes the form of contractions and expansions, that are present in the DUCET for these scripts. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #54 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:33:04 UTC --- Final note: it is highly recommanded to NOT save texts with an implicit normalization. Even if normalization is implemted correctly. There are known defects (yes bugs in renderers of browsers that frequently do not implement normalizations and that are not able to sort, combine and position the diacritics correctly if they are not in a specific order, which is not the same as the normalized order) There are also because incorrect assumptions made by writers (that have not understood when and where to insert CGJ to restrict the normalization of reordering some pairs of diacritics), and so have written their texts in such a way that they seem to render correctly, but only on a bogous browser not performing the normalizations correctly and/or with strong limitations in their text renderer (unable to recognize strings that are canonically equivalent but for which they expect only one order for successive diacritics in order to position them correctly). This type of defects is typical of the bug described above about the normalized order of the DAGESH (a central point in the middle of a consonannt letter, in order to modify it) or SIN/SHIN DOTS (above the letter, on the left or right, also modifying the consonnant), and the other Hebrew vowel diacritics: Yes the normalization reorders the vowel diacritics before the diacritics that modify the consonnant (this is the effect of an old assignment of their relative combining classes, in a completely illogical order of values, but this will NEVER be changed as it would affect the normalizations). But many renderers are not able to display correctly the strings that are encoded in normalized order (base consonnant, vowel diacritic, sin dot or shin dot or dagesh). Instead they expect that the string will be encoded as (base consonnant, dagesh or sin dot or shin dot, vowel diacritic), even if it is completely canonically equivalent to the previous and should display exactly the same ! (such rendering bugs were found in old versions of Windows with IE6 or before). For this reason, you should not, on MediaWiki, apply any implicit renormalization of any edited text. If one wants to enter (base consonnant, dagesh or sin dot or shin dot, vowel diacritic) in the Wiki text, keep it unchanged, do not normalize it, as it will display correctly on both the old bogous renderers and on newer ones. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #55 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:37:59 UTC --- All my remarks in the previous message also apply to the Arabic diacritics. For example the assumptions made by Brion Viber in his message #23 are completely wrong. He has not understood what is normalization and the fact that, only with conforming renderers, the normalization *must not* affect the rendering (but if they do, this is due to bugs in renderers, not bugs in the normalizer used on MediaWiki). -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #50 from Amir E. Aharoni amir.ahar...@mail.huji.ac.il 2011-05-26 17:47:09 UTC --- See another demonstration of this problem here: http://en.wikisource.org/wiki/User:Amire80/Havrakha -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Brion Vibber br...@wikimedia.org changed: What|Removed |Added Status|REOPENED|ASSIGNED AssignedTo|wikibugs-l@lists.wikimedia. |br...@wikimedia.org |org | --- Comment #51 from Brion Vibber br...@wikimedia.org 2011-05-26 17:54:56 UTC --- Assigning to me so we can look over the current state and see about fixing it up. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 Amir E. Aharoni amir.ahar...@mail.huji.ac.il changed: What|Removed |Added Status|RESOLVED|REOPENED CC||amir.ahar...@mail.huji.ac.i ||l Resolution|LATER | --- Comment #49 from Amir E. Aharoni amir.ahar...@mail.huji.ac.il 2011-05-22 07:45:52 UTC --- Marking REOPENED. The standard was updated since 2006. We discussed this in the Berlin Hackathon. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 lɛʁi לערי ריינהארט gangl...@torg.is changed: What|Removed |Added CC||gangl...@torg.is Priority|High|Normal -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399 --- Comment #48 from lɛʁi לערי ריינהארט gangl...@torg.is 2010-01-06 12:20:29 UTC --- FYI: https://bugzilla.wikimedia.org/show_activity.cgi?id=2399 I did not change priorities; I only added me as CC:. It seams that the Priority field is gone. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l