bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2018-10-28 Thread Assaf Gordon
tags 23665 fixed close 23665 stop (triaging old bugs) On 2016-05-31 8:15 p.m., Assaf Gordon wrote: On May 31, 2016, at 20:54, Pádraig Brady wrote: On 01/06/16 01:38, Assaf Gordon wrote: 2. add a bit more verbose progress information to the 'sort-debug-warn.sh' test - just so it'll be

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-06-01 Thread Pádraig Brady
On 01/06/16 03:15, Assaf Gordon wrote: On May 31, 2016, at 20:54, Pádraig Brady wrote: On 01/06/16 01:38, Assaf Gordon wrote: 2. add a bit more verbose progress information to the 'sort-debug-warn.sh' test - just so it'll be easier to discuss to the changed messages.

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Assaf Gordon
Hello Karl and all, > On May 31, 2016, at 19:15, Karl Berry wrote: [...] > I'm not sure what you mean by [...]. The %lu? > Are you proposing to just add the word "sort"? That's not needed IMHO. I was suggesting exactly that :) Also, the word "key" appears in few other

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Paul Eggert
On 05/31/2016 04:15 PM, Karl Berry wrote: Are the collation rules for en_US.UTF-8 documented or even reasonably comprehensively described anywhere? Although I think they are taken from ISO/IEC 14651, I expect they've diverged from the standard by now, as a new version of the standard came

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Karl Berry
Just to verify, the surprising result is in C locale? Yes. as collating rules for UTF-8 make leading spaces less significant. Yes, which is a different problem, in itself. Let me ask this: Are the collation rules for en_US.UTF-8 documented or even reasonably comprehensively described

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Pádraig Brady
On 31/05/16 20:11, Assaf Gordon wrote: Hello Karl! On 05/31/2016 02:32 PM, Karl Berry wrote: I run LC_ALL=en_US.UTF-8 sort --debug -k 2 /tmp/foo # or -k 2,2 et al. And get the nicely explanatory output for the "surprising" result: [...] Just to verify, the surprising result is in C

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Assaf Gordon
Hello Karl! On 05/31/2016 02:32 PM, Karl Berry wrote: I run LC_ALL=en_US.UTF-8 sort --debug -k 2 /tmp/foo # or -k 2,2 et al. And get the nicely explanatory output for the "surprising" result: [...] Just to verify, the surprising result is in C locale? I'm seeing the following, for

bug#23665: spaces in keys: doc, --debug in LC_ALL=C

2016-05-31 Thread Karl Berry
Consider this three-line source file, say /tmp/foo: M Build/zfile M Master/mfile MM Build/afile There are two spaces after the M on the first two lines (and no trailing spaces on any line). I was trying to sort on the second "field". I run LC_ALL=en_US.UTF-8 sort --debug -k 2 /tmp/foo # or