Here is a snippet reproducing the bug w/o involving
heavy machinery like kmail (where I first noticed this),
or thunderbird on the same data (where it also occurs) --
just plain sort(1) of strings:
$ locale
LANG=ru_RU.UTF-8
LANGUAGE=en_IL:en_US:en_GB:en
LC_CTYPE="ru_RU.UTF-8"
LC_NUMERIC="ru_RU.UTF-8"
LC_TIME="ru_RU.UTF-8"
LC_COLLATE="ru_RU.UTF-8"
LC_MONETARY="ru_RU.UTF-8"
LC_MESSAGES="ru_RU.UTF-8"
LC_PAPER="ru_RU.UTF-8"
LC_NAME="ru_RU.UTF-8"
LC_ADDRESS="ru_RU.UTF-8"
LC_TELEPHONE="ru_RU.UTF-8"
LC_MEASUREMENT="ru_RU.UTF-8"
LC_IDENTIFICATION="ru_RU.UTF-8"
LC_ALL=
[EMAIL PROTECTED]:~$
[EMAIL PROTECTED]:~$ sort
какао
kde
имбирь
!admin
block
!spam
!admin
block
kde
!spam
имбирь
какао
[EMAIL PROTECTED]:~$ LANG="en_GB.UTF-8"
[EMAIL PROTECTED]:~$ export LANG
[EMAIL PROTECTED]:~$ sort
какао
kde
имбирь
!admin
block
!spam
!admin
!spam
block
kde
имбирь
какао
Being a Russian native speaker, I (and all my friend Russian speakers
who've seen this behaviour) believe that the sorting output under
the Russian locale should have been the same as the one in the
second example, under the English one. However, I plead ignorance
to any sort of formal collation standard of Russian out there,
so if anybody has a reference to such, please post it.