Re: [Implemented] [coreutils] Partial UTF-8 support for "cut -c"

2019-08-12 Thread Eric Fischer
I will reopen the can of worms of again offering my own multibyte cut (and other coreutils) if the maintainers ever decide they want it: https://github.com/ericfischer/coreutils/blob/multibyte-squash/src/cut.c I think the normalization ambiguity here is resolved by the POSIX standard's

Re: [Implemented] [coreutils] Partial UTF-8 support for "cut -c"

2019-08-12 Thread Assaf Gordon
Hello, On Mon, Aug 12, 2019 at 09:19:54PM +0200, jaime.mosqu...@tutanota.com wrote: > I have partially implemented the option "-c" ("--characters") for UTF-8 > non-ASCII characters[...] First and foremost, Thank you for taking the time and effort to develop new features and send them to the

Re: for the next gnulib update

2019-08-12 Thread Assaf Gordon
On Mon, Aug 12, 2019 at 05:55:55PM +0200, Bernhard Voelker wrote: > On 8/12/19 5:50 AM, Assaf Gordon wrote: > > Updated patch (fixed typo in commit message). > > +1 thanks thanks, pushed here: https://git.savannah.gnu.org/cgit/coreutils.git/commit/?id=a3d070fa3269e89dfad49fde8ea30758afa36f4b

[Implemented] [coreutils] Partial UTF-8 support for "cut -c"

2019-08-12 Thread jaime.mosquera
Good evening. I have partially implemented the option "-c" ("--characters") for UTF-8 non-ASCII characters, so that using a text in any language other than English does not result in rather subtle bugs ("cut -c 1-79" produces 79 characters, except that lines with one accented letter are one

Re: for the next gnulib update

2019-08-12 Thread Bernhard Voelker
On 8/12/19 5:50 AM, Assaf Gordon wrote: Updated patch (fixed typo in commit message). +1 thanks