Package: locales Version: 2.3.6.ds1-13 Severity: important Tags: patch l10n
When sorting data, the sort order ignore spaces, being very anoying to use it with a database like PostgreSQL. Below an example: $ cat list.txt # A random name list Adriano José Adriana da Silva Adrian Kuerten The strange behavior: $ cat lista.txt | sort Adriana da Silva Adrian Kuerten Adriano José Changing /usr/share/i18n/locales/pt_BR, section LC_COLLATE to: LC_COLLATE copy "iso14651_t1" reorder-after <U00A0> <U0020><CAP>;<CAP>;<CAP>;<U0020> reorder-end END LC_COLLATE I have the correct behavior: $ cat lista.txt | sort Adrian Kuerten Adriana da Silva Adriano José There is a topic in http://sourceware.org/bugzilla/show_bug.cgi?id=3405, but the pt_BR file there doesn't work well with chars 'a','á','ã',etc... I think this could be a problem for other languages too, but not for sure. -- System Information: Debian Release: 4.0 APT prefers testing APT policy: (500, 'testing') Architecture: i386 (i686) Shell: /bin/sh linked to /bin/bash Kernel: Linux 2.6.18-4-686 Locale: LANG=pt_BR, LC_CTYPE=pt_BR (charmap=ISO-8859-1) Versions of packages locales depends on: ii debconf [debconf-2.0] 1.5.11 Debian configuration management sy ii libc6 [glibc-2.3.6.ds1-1] 2.3.6.ds1-13 GNU C Library: Shared libraries locales recommends no packages. -- debconf information: * locales/default_environment_locale: pt_BR.UTF-8 * locales/locales_to_be_generated: pt_BR ISO-8859-1, pt_BR.UTF-8 UTF-8