Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
2009-07-08
Thread
=D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank writes: > On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote: > > Bastian Blank writes: > > > You have to describe _why_ it is wrong. Different languages may have > > > different collation rules. > > Yes, i try :) Yes, different languages have different collation, but > > ru_RU.UTF8 locale is unicode locale and as i can understand this table: > > http://unicode.org/charts/collation/chart_Cyrillic.html > > Where is the reference to the russian language? I don't know, is you really need it ? Please, see below. > > why after ? This is mistake, IMHO. > > Please present evidence in form of official documents for the russian > language. According to wikipedia[1], this letter is not used in russian > anyway. Yes, this is what I was trying to say :) Russian locale define this non-Russian (Ukrainian) letter (why ?), and define it at IMHO very strange position - after , because is after in unicode ( http://unicode.org/charts/collation/chart_Cyrillic.html ). I assume that someone trying to implement correct Ukrainian collation with Russian locale, but do this incorrectly... This is not a problem for Russian text, because Russian is not use this letter. If definition of this letter can be removed from Russian locale, or placed at correct position, like in Ukrainian locale - before at least, then Ukrainian collation rules will be work properly with Russian locale too. It would be useful IMHO. BTW, as you can see in [1], (ґ) is located _before_ (д). > [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters -- С уважением, Сергей Бурладян -- To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote: > Bastian Blank writes: > > You have to describe _why_ it is wrong. Different languages may have > > different collation rules. > Yes, i try :) Yes, different languages have different collation, but > ru_RU.UTF8 locale is unicode locale and as i can understand this table: > http://unicode.org/charts/collation/chart_Cyrillic.html Where is the reference to the russian language? > why after ? This is mistake, IMHO. Please present evidence in form of official documents for the russian language. According to wikipedia[1], this letter is not used in russian anyway. Bastian [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters -- Warp 7 -- It's a law we can live with. -- To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
2009-07-07
Thread
=D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank writes: > On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote: > > ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is > > example: > > You have to describe _why_ it is wrong. Different languages may have > different collation rules. Yes, i try :) Yes, different languages have different collation, but ru_RU.UTF8 locale is unicode locale and as i can understand this table: http://unicode.org/charts/collation/chart_Cyrillic.html U0491 must be after U0433 and before U0434 but /usr/share/i18n/locales/ru_RU define it like this: reorder-after why after ? This is mistake, IMHO. -- С уважением, Сергей Бурладян -- To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote: > ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is > example: You have to describe _why_ it is wrong. Different languages may have different collation rules. Bastian -- War isn't a good life, but it's life. -- Kirk, "A Private Little War", stardate 4211.8 -- To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
Package: locales Version: 2.9-12 Severity: normal ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is example: wrong: s...@seb:~$ (export LANG=ru_RU.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г д ґ е є ж correct: s...@seb:~$ (export LANG=uk_UA.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г ґ д е є ж correct: s...@seb:~$ (export LANG=en_US.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г ґ д е є ж -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (800, 'testing'), (800, 'stable'), (70, 'unstable'), (65, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.26-2-686 (SMP w/1 CPU core) Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages locales depends on: ii debconf [debconf-2.0] 1.5.26 Debian configuration management sy ii libc6 [glibc-2.9-1] 2.9-4 GNU C Library: Shared libraries locales recommends no packages. locales suggests no packages. -- debconf information: * locales/default_environment_locale: ru_RU.UTF-8 * locales/locales_to_be_generated: en_GB ISO-8859-1, en_GB.ISO-8859-15 ISO-8859-15, en_GB.UTF-8 UTF-8, en_US ISO-8859-1, en_US.ISO-8859-15 ISO-8859-15, en_US.UTF-8 UTF-8, ru_RU ISO-8859-5, ru_RU.CP1251 CP1251, ru_RU.KOI8-R KOI8-R, ru_RU.UTF-8 UTF-8, ru_UA.UTF-8 UTF-8, uk_UA.UTF-8 UTF-8 -- To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org