Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2014-04-09 Thread Andras Korn
On Fri, Feb 26, 2010 at 12:38:08AM -0600, Jonathan Nieder wrote: Computers are dumb -- Andras wrote: 1. grep has no way of knowing whether a zs sequence is a single letter or two letters, because the combination can occur in compound words without becoming a zs

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-25 Thread Jonathan Nieder
Hi again, Odd names for collating elements I wrote: $ echo 'ch and more' | LANG=cy_GB.UTF-8 sed 's/[[.ch.]]/MATCHED/' sed: -e expression #1, char 21: Invalid collation character Odd, no? It did seem odd, especially since the POSIX documentation uses

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-24 Thread Andras Korn
On Tue, Feb 23, 2010 at 10:29:25PM -0600, Jonathan Nieder wrote: 2. zs is the last letter of the Hungarian alphabet; therefore, no sane character range in a regular expression can include it ([a-zs] would be ambiguous because there isn't a zs glyph). Would [a-[.zs.]] work? ̈́No, because

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-24 Thread Clint Adams
On Wed, Feb 24, 2010 at 10:13:09AM +0100, Andras Korn wrote: ??No, because apparently [.zs.] isn't a valid collating element: Should it be? -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-24 Thread Jonathan Nieder
Clint Adams wrote: On Wed, Feb 24, 2010 at 10:13:09AM +0100, Andras Korn wrote: No, because apparently [.zs.] isn't a valid collating element: Should it be? Yes, I think so: it comes after z in alphabetical order. See http://lists.mysql.com/mysql/204718 for example. glibc thinks so too,

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-23 Thread Andras Korn
On Mon, Feb 22, 2010 at 11:07:21AM +0100, Andras Korn wrote: 1. grep has no way of knowing whether a zs sequence is a single letter or two letters, because the combination can occur in compound words without becoming a zs letter; for example, in fúvószenekar (fúvós + zenekar), it's simply an

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-23 Thread Jonathan Nieder
Hi, I have no clue about the rest of these, but Andras Korn wrote: On Mon, Feb 22, 2010 at 11:07:21AM +0100, Andras Korn wrote: 2. zs is the last letter of the Hungarian alphabet; therefore, no sane character range in a regular expression can include it ([a-zs] would be ambiguous because

Bug#570929: Hungarian locale: zs is treated as a single letter, with undesirable consequences

2010-02-22 Thread Andras Korn
Package: locales Version: 2.10.2-6 Severity: normal Hi, in Hungarian, zs (as well as sz, cs, ty, dz, dzs, gy and ly) are said to be part of the alphabet and each combination is considered to be a single letter; however, they are represented by two or more characters; there aren't single glyphs