On 2021-09-25 4:29 a.m., Smylers wrote:
So it appears that en_CA.utf8 and en_GB.utf8 sort ‘a’ and ‘A’ the
opposite way round to each other. I wonder why. (Not relevant to any bug
in Sort::Naturally, but it's now intriguing me.)
I tried en_US.utf8 and en_AU.utf8. They both follow en_GB.utf8.
Shawn H Corey writes:
> And my environment is:
>
> $ env|grep LC_|sort
> LC_ADDRESS=en_CA.UTF-8
> LC_IDENTIFICATION=en_CA.UTF-8
> LC_MEASUREMENT=en_CA.UTF-8
> LC_MONETARY=en_CA.UTF-8
> LC_NAME=en_CA.UTF-8
> LC_NUMERIC=en_CA.UTF-8
> LC_PAPER=en_CA.UTF-8
> LC_TELEPHONE=en_CA.UTF-8
>
Setting LC_ALL=C gives the Unicode sequence for Perl's sort but
Sort::Naturally still does not seem correct:
LC_ALL=C ./sort-test.pl
unsorted : 4 A X i 1 x 10 a B ä y z į C Ä b c Į Y Z än and
ÄND And Any ant Äm Äs
no locale, perl : 1 10 4 A And Any B C X Y Z a and ant b c
On 2021-09-25 3:01 a.m., Smylers wrote:
That does look odd. Which locale are you running this under, which
version of Perl, and which version of Sort::Naturally?
Also, when I first ran your script I initially got lots of:
Wide character in say at ./naturally line 23.
Adding this made Perl
Shawn H Corey writes:
> I was testing different sort routines and I think I stopped a bug in
> Sort::Naturally (see attached script). It's output is:
>
> unsorted : 4 A X i 1 x 10 a B ä y z į C Ä b c Į Y Z än and ÄND
> And Any ant Äm Äs
>
> no locale, perl : 1 10 4 A And Any