[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2014-05-18 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Andre Klapper aklap...@wikimedia.org changed:

   What|Removed |Added

 Status|ASSIGNED|NEW

--- Comment #60 from Andre Klapper aklap...@wikimedia.org ---
Amir: Do you (or the L10N team) plan to take a look at this at some point? 
This ticket is place 14 in the list of open tickets with the highest votes...

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2014-02-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

--- Comment #59 from Dovi Jacobs dovijac...@yahoo.com ---
For an extremely clear description of the problem in Hebrew, see here (pp. 8
ff.):
http://www.sbl-site.org/Fonts/SBLHebrewUserManual1.5x.pdf

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2012-07-30 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

matanya matanya.mo...@gmail.com changed:

   What|Removed |Added

 CC||matanya.mo...@gmail.com
 AssignedTo|br...@wikimedia.org |amir.ahar...@mail.huji.ac.i
   ||l

--- Comment #58 from matanya matanya.mo...@gmail.com 2012-07-30 13:53:46 UTC 
---
reassigned to Amir as he is part of localization engineers. This bug is still
present as can seen in : https://en.wikisource.org/wiki/User:Amire80/Havrakha

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-12-30 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Jan Kucera (Kozuch) garba...@seznam.cz changed:

   What|Removed |Added

   Priority|Lowest  |Highest
 CC||garba...@seznam.cz

--- Comment #58 from Jan Kucera (Kozuch) garba...@seznam.cz 2011-12-30 
15:46:32 UTC ---
Because of votes rasing importance/priority according to following scheme:
15+ votes - highest
5-15 votes - high
Community must have a voice within development.

Regards, Kozuch
http://en.wikipedia.org/wiki/User:Kozuch

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-12-30 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Bugmeister Bot mhershber...@wikimedia.org changed:

   What|Removed |Added

   Priority|Highest |Lowest
 CC|garba...@seznam.cz  |

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-12-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Ryan Kaldari rkald...@wikimedia.org changed:

   What|Removed |Added

 CC||rkald...@wikimedia.org

--- Comment #57 from Ryan Kaldari rkald...@wikimedia.org 2011-12-08 21:39:36 
UTC ---
This should probably be reassigned to one of our localization engineers.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

merelo...@gmail.com changed:

   What|Removed |Added

 CC||merelo...@gmail.com

--- Comment #56 from merelo...@gmail.com 2011-09-29 12:48:02 UTC ---
*** Bug 31183 has been marked as a duplicate of this bug. ***

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Krinkle krinklem...@gmail.com changed:

   What|Removed |Added

 Blocks|3860|1527

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Krinkle krinklem...@gmail.com changed:

   What|Removed |Added

 Blocks|30672   |30673

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Siebrand s.mazel...@xs4all.nl changed:

   What|Removed |Added

 CC||s.mazel...@xs4all.nl
 Blocks||30672

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Philippe Verdy verd...@wanadoo.fr changed:

   What|Removed |Added

 CC||verd...@wanadoo.fr

--- Comment #52 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 15:58:14 
UTC ---
Apparently, you have not implemnted the contractions and expansions of UCA.

Note that there has been NO change in Unicode 5.1 (or later) for the
normalization which is now stabilized since at least Unicode 4.0.1.
The bugs above are most probably not related to normalization, if it is
implemented correctly (and normalization is an easy problem that can be
implemtned very efficiently).

And the changes in the DUCET (or now the CLDR DUCET) do not affect how Hebrew,
Arabic or Myanmar is sorted, within the same script.

Then you should learn to separate the Unicode Normalization Algorithm (UNA),
the Unicode Collation Algorithm (UCA), and the Unicode Bidi Algorithm (UBA),
because the Bidi algorithm only affects the display, but definitely NOT the
other two.

And the order produced by normalization is orthogonal to the order of collation
weights generated by UCA, even if normalization is assumed to be performed
first before computing collations (but this is not a requirement, it just helps
reducing the problem, by making sure that canonically equivalent strings will
collate the same.

Many posters above seem to be completely mixing the problems !

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

--- Comment #53 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:00:22 
UTC ---
Note: for Thai, Lao, Tai Viet, the normalization does not reorder the prepended
vowels (neither do the Bidi algorithm).

But such reordering is *required* when implementing the UCA, and this takes the
form of contractions and expansions, that are present in the DUCET for these
scripts.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

--- Comment #54 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:33:04 
UTC ---
Final note: it is highly recommanded to NOT save texts with an implicit
normalization. Even if normalization is implemted correctly.

There are known defects (yes bugs in renderers of browsers that frequently do
not implement normalizations and that are not able to sort, combine and
position the diacritics correctly if they are not in a specific order, which is
not the same as the normalized order)

There are also because incorrect assumptions made by writers (that have not
understood when and where to insert CGJ to restrict the normalization of
reordering some pairs of diacritics), and so have written their texts in such a
way that they seem to render correctly, but only on a bogous browser not
performing the normalizations correctly and/or with strong limitations in their
text renderer (unable to recognize strings that are canonically equivalent but
for which they expect only one order for successive diacritics in order to
position them correctly).

This type of defects is typical of the bug described above about the
normalized order of the DAGESH (a central point in the middle of a consonannt
letter, in order to modify it) or SIN/SHIN DOTS (above the letter, on the left
or right, also modifying the consonnant), and the other Hebrew vowel
diacritics: Yes the normalization reorders the vowel diacritics before the
diacritics that modify the consonnant (this is the effect of an old assignment
of their relative combining classes, in a completely illogical order of
values, but this will NEVER be changed as it would affect the normalizations).

But many renderers are not able to display correctly the strings that are
encoded in normalized order (base consonnant, vowel diacritic, sin dot or shin
dot or dagesh). Instead they expect that the string will be encoded as (base
consonnant, dagesh or sin dot or shin dot, vowel diacritic), even if it is
completely canonically equivalent to the previous and should display exactly
the same ! (such rendering bugs were found in old versions of Windows with IE6
or before).

For this reason, you should not, on MediaWiki, apply any implicit
renormalization of any edited text. If one wants to enter (base consonnant,
dagesh or sin dot or shin dot, vowel diacritic) in the Wiki text, keep it
unchanged, do not normalize it, as it will display correctly on both the old
bogous renderers and on newer ones.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-09-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

--- Comment #55 from Philippe Verdy verd...@wanadoo.fr 2011-09-03 16:37:59 
UTC ---
All my remarks in the previous message also apply to the Arabic diacritics.

For example the assumptions made by Brion Viber in his message #23 are
completely wrong. He has not understood what is normalization and the fact
that, only with conforming renderers, the normalization *must not* affect the
rendering (but if they do, this is due to bugs in renderers, not bugs in the
normalizer used on MediaWiki).

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-05-26 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

--- Comment #50 from Amir E. Aharoni amir.ahar...@mail.huji.ac.il 2011-05-26 
17:47:09 UTC ---
See another demonstration of this problem here:

http://en.wikisource.org/wiki/User:Amire80/Havrakha

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-05-26 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Brion Vibber br...@wikimedia.org changed:

   What|Removed |Added

 Status|REOPENED|ASSIGNED
 AssignedTo|wikibugs-l@lists.wikimedia. |br...@wikimedia.org
   |org |

--- Comment #51 from Brion Vibber br...@wikimedia.org 2011-05-26 17:54:56 UTC 
---
Assigning to me so we can look over the current state and see about fixing it
up.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2011-05-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399

Amir E. Aharoni amir.ahar...@mail.huji.ac.il changed:

   What|Removed |Added

 Status|RESOLVED|REOPENED
 CC||amir.ahar...@mail.huji.ac.i
   ||l
 Resolution|LATER   |

--- Comment #49 from Amir E. Aharoni amir.ahar...@mail.huji.ac.il 2011-05-22 
07:45:52 UTC ---
Marking REOPENED. The standard was updated since 2006. We discussed this in the
Berlin Hackathon.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2010-01-06 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399


lɛʁi לערי ריינהארט gangl...@torg.is changed:

   What|Removed |Added

 CC||gangl...@torg.is
   Priority|High|Normal




-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 2399] Unicode normalization sorts Hebrew/Arabic/Myanmar vowels wrongly

2010-01-06 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=2399





--- Comment #48 from lɛʁi לערי ריינהארט gangl...@torg.is  2010-01-06 12:20:29 
UTC ---
FYI: https://bugzilla.wikimedia.org/show_activity.cgi?id=2399
I did not change priorities; I only added me as CC:.
It seams that the Priority field is gone.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l