bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-09 Thread Pádraig Brady
On 08/12/17 19:15, Assaf Gordon wrote: > Hello Mark, > > First, > thank you for taking the time and effort > to test our development snapshot, and reporting results back. > This kind of feedback is critical in getting multibyte support ready. > > > Second, > I can confirm the behavior you are

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-09 Thread Mark Roberts
Dear Assaf, I agree that once multibyte support is added to fold(1), the man pages, the help screen and texi manual must be updated to clearly indicate the "-b/--bytes" only applies to \b \t \r and never to multibyte characters. My suggestion for man-page: == Old: ---

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-08 Thread Assaf Gordon
Hello Mark, First, thank you for taking the time and effort to test our development snapshot, and reporting results back. This kind of feedback is critical in getting multibyte support ready. Second, I can confirm the behavior you are observing, reproduced here with 'od' for easier output: ##

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-08 Thread Mark Roberts
Dear Assaf, the reason for the unexpected behavior of 'fold', namely that specifying --bytes doesn't make it count bytes, is evident after a look at the source code. When --bytes is not specified, the program treats '\b', '\r' and '\t' specially. It assumes a tab width of eight

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-07 Thread Mark Roberts
Dear Assaf, If you'd like to help us test these patches, please try an unofficial development snapshot here: https://files.housegordon.org/src/coreutils-multibyte-experimental-8.28.39-79242.tar.xz I have taken a look and have an unexpected result: fold (version 8.28.39-79242) reacts to my

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-07 Thread Mark Roberts
Dear Assaf, thanks for the clarification. Yes, I did mean Debian 7. I didn't realise, quite how old my Debian was. I use it eight hours a day and it is stable. Considering you are using Debian 7 from 2013, and coreutils 8.13 from 2011, the fact it is 2017 is not very relevant. I hadn't

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-07 Thread Assaf Gordon
Hello, On 2017-12-07 03:10 AM, Mark Roberts wrote: I am using fold version 8.13 on a Debian 3.2.93-1 Do you mean Debian 7 (Wheezy) with Linux Kernel 3.2.93-1 ? cat filename | fold If 'filename' contains utf8 characters consisting of more than one byte, fold will consider breaking the

bug#29606: Command 'fold' dangerous with utf-8 input

2017-12-07 Thread Mark Roberts
Dear maintainers, I am using fold version 8.13 on a Debian 3.2.93-1 cat filename | fold If 'filename' contains utf8 characters consisting of more than one byte, fold will consider breaking the line inside such a character. There is no option to stop it doing that. Except, of course "-s":