On 9/18/07, Uriel <[EMAIL PROTECTED]> wrote: > Don't complain, at least it is not producing random behaviour, I have > seen versions of gnu awk that when feed plain ASCII input, if the > locale was UTF-8, rules would match random lines of input, the fix? > set the locale to 'C' at the top of all your scripts (and don't even > think of dealing with files which actually contain non-ASCII UTF-8). > > This was some years ago, it might be fixed by now, but it demonstrates > how the locale insanity makes life so much more fun.-
Heh, funny that this thread got revived the very day that my colleague's backup script choked because he was running in a utf8 locale and hit a filename encoded in iso8859-1. Apparently GNU sed's . stops matching when it hits an invalid bytestream (which is not entirely unreasonable I guess). -sqweek
