Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-08 Thread =D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank  writes:
> On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote:
> > Bastian Blank  writes:
> > > You have to describe _why_ it is wrong. Different languages may have
> > > different collation rules.
> > Yes, i try :) Yes, different languages have different collation, but
> > ru_RU.UTF8 locale is unicode locale and as i can understand this table:
> > http://unicode.org/charts/collation/chart_Cyrillic.html
> 
> Where is the reference to the russian language?

I don't know, is you really need it ? Please, see below.

> > why after  ? This is mistake, IMHO.
> 
> Please present evidence in form of official documents for the russian
> language. According to wikipedia[1], this letter is not used in russian
> anyway.

Yes, this is what I was trying to say :) Russian locale define this
non-Russian (Ukrainian) letter (why ?), and define it at IMHO very
strange position - after , because  is after 
in unicode ( http://unicode.org/charts/collation/chart_Cyrillic.html ).

I assume that someone trying to implement correct Ukrainian collation
with Russian locale, but do this incorrectly... This is not a problem for
Russian text, because Russian is not use this letter.

If definition of this  letter can be removed from Russian locale, or
placed at correct position, like in Ukrainian locale - before  at least,
then Ukrainian collation rules will be work properly with Russian locale too.

It would be useful IMHO.

BTW, as you can see in [1],  (ґ) is located _before_  (д).

> [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters

-- 
С уважением, Сергей Бурладян



--
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-08 Thread Bastian Blank
On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote:
> Bastian Blank  writes:
> > You have to describe _why_ it is wrong. Different languages may have
> > different collation rules.
> Yes, i try :) Yes, different languages have different collation, but
> ru_RU.UTF8 locale is unicode locale and as i can understand this table:
> http://unicode.org/charts/collation/chart_Cyrillic.html

Where is the reference to the russian language?

> why after  ? This is mistake, IMHO.

Please present evidence in form of official documents for the russian
language. According to wikipedia[1], this letter is not used in russian
anyway.

Bastian

[1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters

-- 
Warp 7 -- It's a law we can live with.



-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread =D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank  writes:

> On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote:
> > ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
> > example:
> 
> You have to describe _why_ it is wrong. Different languages may have
> different collation rules.

Yes, i try :) Yes, different languages have different collation, but
ru_RU.UTF8 locale is unicode locale and as i can understand this table:
http://unicode.org/charts/collation/chart_Cyrillic.html

U0491 must be after U0433 and before U0434

but /usr/share/i18n/locales/ru_RU define it like this:
reorder-after 

why after  ? This is mistake, IMHO.

-- 
С уважением, Сергей Бурладян



--
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread Bastian Blank
On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote:
> ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
> example:

You have to describe _why_ it is wrong. Different languages may have
different collation rules.

Bastian

-- 
War isn't a good life, but it's life.
-- Kirk, "A Private Little War", stardate 4211.8



-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread Sergey Burladyan
Package: locales
Version: 2.9-12
Severity: normal


ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
example:

wrong:
s...@seb:~$ (export LANG=ru_RU.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
д
ґ
е
є
ж

correct:
s...@seb:~$ (export LANG=uk_UA.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
ґ
д
е
є
ж

correct:
s...@seb:~$ (export LANG=en_US.UTF-8; echo "абвгґдеєжзиіїйклмнопрстуфхцчшщьюя" 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
ґ
д
е
є
ж


-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (800, 'testing'), (800, 'stable'), (70, 'unstable'), (65, 
'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-2-686 (SMP w/1 CPU core)
Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages locales depends on:
ii  debconf [debconf-2.0] 1.5.26 Debian configuration management sy
ii  libc6 [glibc-2.9-1]   2.9-4  GNU C Library: Shared libraries

locales recommends no packages.

locales suggests no packages.

-- debconf information:
* locales/default_environment_locale: ru_RU.UTF-8
* locales/locales_to_be_generated: en_GB ISO-8859-1, en_GB.ISO-8859-15 
ISO-8859-15, en_GB.UTF-8 UTF-8, en_US ISO-8859-1, en_US.ISO-8859-15 
ISO-8859-15, en_US.UTF-8 UTF-8, ru_RU ISO-8859-5, ru_RU.CP1251 CP1251, 
ru_RU.KOI8-R KOI8-R, ru_RU.UTF-8 UTF-8, ru_UA.UTF-8 UTF-8, uk_UA.UTF-8 UTF-8



--
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org