Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Danilo Šegan
[EMAIL PROTECTED] (Danilo Åegan) writes: > $ LC_ALL=en_GB.UTF-8 time grep2.5 XYZ test.txt > Command exited with non-zero status 1 > 0.05user 0.07system 0:00.12elapsed 100%CPU (0avgtext+0avgdata 0maxresident)k > 0inputs+0outputs (140major+45minor)pagefaults 0swaps Whoops, this above is total crap.

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Mika Fischer
Hi! * Markus Kuhn <[EMAIL PROTECTED]> [2003-11-07 16:33]: > It seems grep performs about 100x worse in a UTF-8 locale than in and > ASCII locale, even where the search strring contains no regex > metacharacters. Same here on Debian with grep 2.5.1 and libc 2.3.2. > There is technically no reason

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Glenn Maynard
On Fri, Nov 07, 2003 at 04:49:58PM +0100, Danilo Segan wrote: > This doesn't happen with: > > $ grep --version > grep (GNU grep) 2.4.2 This was probably before full multibyte support was added to grep; the issue here specifically only happens in multibyte encodings. (My grep is slow in en_US.UTF

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Danilo Šegan
Hi Markus, Markus Kuhn <[EMAIL PROTECTED]> writes: > Rob Park wrote on 2003-11-08 00:49 UTC: >> grep is slower on my system, but it doesn't appear to be as bad as on >> your system. > > Your results show that grep in UTF-8 mode is equally 100x slower than in > single-byte mode, just like on my sy

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Rob Park
Markus Kuhn wrote: Your results show that grep in UTF-8 mode is equally 100x slower than in single-byte mode, just like on my system (300 MHz P3). You just have used a faster CPU. D'oh :) -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Markus Kuhn
Rob Park wrote on 2003-11-08 00:49 UTC: > grep is slower on my system, but it doesn't appear to be as bad as on > your system. Your results show that grep in UTF-8 mode is equally 100x slower than in single-byte mode, just like on my system (300 MHz P3). You just have used a faster CPU. Markus

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Danilo Segan
Markus Kuhn <[EMAIL PROTECTED]> writes: > $ grep --version > grep (GNU grep) 2.5.1 This doesn't happen with: $ grep --version grep (GNU grep) 2.4.2 $ LC_ALL=POSIX time grep XYZ test.txt Command exited with non-zero status 1 0.03user 0.07system 0:00.36elapsed 27%CPU (0avgtext+0avgdata 0maxreside

Re: grep is horriby slow in UTF-8 locales

2003-11-08 Thread Glenn Maynard
On Fri, Nov 07, 2003 at 12:52:44PM +, Markus Kuhn wrote: > $ grep --version > grep (GNU grep) 2.5.1 > $ LC_ALL=en_GB.UTF-8 time grep XYZ test.txt > Command exited with non-zero status 1 > 6.83user 0.07system 0:06.93elapsed 99%CPU (0avgtext+0avgdata 0maxresident)k > 0inputs+0outputs (157major+34