На Sun, 1 Feb 2009 00:36:45 +0600
"Fedor P. Goncharov" <[email protected]> записано:
> But if uniq work in utf-8 environment with KOI8R
> dictionary, it just omit most of lines as repeating. You can verify
> my words if compare dictionary in repository before and after patch
> ( 1,8M vs 74K for /usr/share/myspell/dicts/ru_RU.dic).
Sure, debian build scripts are intended to work in clean environment
and in C locale. I don't know why Martin-Eric doesn't use pbuilder or
something similar.
> Second:
> BUG #511290 already fixed in this string:
> grep -h '[��]' $(DICTIONARIES) | tr '\243\263' '\305\345' >
> yo_subst.koi ^^^^^^^^^^^^
>
yo==IO==ё
> but broken because grep and tr don't know what they work with koi8r
> codepage and because somebody mistaken
Nope, tr uses symbol codes and works fine. The problem itself is in
grep.
So the right strings are:
build-arch:
# Generate ispell dictionary
LC_CTYPE=C grep -h '[ёЁ]' $(DICTIONARIES) | tr '\243\263'
'\305\345' > yo_subst.koi cat $(DICTIONARIES) yo_subst.koi >
$(ILANGUAGE).dict
And may be there should locale specification in line 23 before
i2myspell call, I don't know exactly.
На Sat, 31 Jan 2009 22:48:13 +0200
Martin-Éric Racine <[email protected]> записано:
> On Sat, Jan 31, 2009 at 8:36 PM, Fedor P. Goncharov
> <[email protected]> wrote:
> > Package: aspell-ru
> > Version: 0.99g5-6
> > Severity: critical
> >
> > First:
> > patch from bug #511290 do nothing instead "sorting the content and
> > filtering it". But if uniq work in utf-8 environment with KOI8R
> > dictionary, it just omit most of lines as repeating.
>
> Too many tools in free software went from assuming locale C to
> assuming UTF-8, which is why we have so many problems building this
> dictionary, then converting it from ispell format to myspell and
> aspell formats. This regularly breaks and I'm not sure how to solve
> this in a permanent and predictable way, because forcing each tool
> used in the scripts to use KOI8-R is not always possible.
It works fine in C locale.
>
> > Second:
> > BUG #511290 already fixed in this string:
> > grep -h '[��]' $(DICTIONARIES) | tr '\243\263' '\305\345' >
> > yo_subst.koi ^^^^^^^^^^^^
> >
yo==IO==ё
> > but broken because grep and tr don't know what they work with koi8r
> > codepage and because somebody mistaken
>
> Indeed, that's correct.
>
> > Third:
> > Товарисч Александр вы либо немножко
глупый, либо афигенный 'ч'удак,
> > в первом случае я вообще не понимаю как
у вас оказался данный патч,
> > но понимаю почему вы до сих пор не
перешли на утф, во втором вы
> > наверно догадались что я о вас думаю.
>
> (English summary: the bug reporter has an extremely negative view of
> upstream's work, among other things about his recalcitrance at getting
> around transiting the source files to UTF-8).
Nope, this not really smart message was addressed to me, as I can see.
There are nothing except personal offence there.
> I've always wondered that too. Transiting the source files to UTF-8
> would make it much easier to generate wordlists for a variety of
> dictionary formats and to maintain the build scripts without
> constantly having to edit them separately in a KOI8-R -enabled editor.
>
> However, repeated attempts by maintainers of dictionary packages at
> various distributions (Fedora, Debian, etc.) to contact upstream all
> resulted with upstream ignoring them. Some even tried phoning him
> directly at Moscow State University (which is where the MSU domain in
> his address comes from) and he hung up the phone on them.
As I said, I can try to contact him offline. Is this the only thing you
want to tell him, or there are something else? Please mail me the
details.
--
Best regards,
Alexander GQ Gerasiov
Contacts:
e-mail: [email protected] Jabber: [email protected]
Homepage: http://gq.net.ru ICQ: 7272757
PGP fingerprint: 0628 ACC7 291A D4AA 6D7D 79B8 0641 D82A E3E3 CE1D
--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]