Markus Kuhn wrote:
> This property is in essence what makes the range notation in regular
> expressions useful, because by selecting a range of characters in some
> prefix of a string, you are guaranteed to select a consecutive sequence
> of strings in a sorted list. With multi-pass comparison algorithms like
> ISO, this property does not hold any more, and that is why many people
> (including myself) prefer to hold on to the old naive "C" strcmp()
> sorting order until we get a nicer single-pass sorting algorithm that is
> guaranteed to be a homomorphism over string concatenation.
Can't we have instead an alternative local available with single pass sorting
algorythm ?
Something like LC_COLLATE=en_GB.single_pass
where we would sort as :
ter
tes
t�r
t�s
tfs
instead of the multipass ISO sorting which would be :
ter
t�r
tes
t�s
tfs
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/