I was thinking about the annoying BOM-like sequence that Windows 2000's and XP's Notepads are putting at the beginning of UTF-8 files. The byte sequence "EF BB BF" that's invalid as a header/signature in Unix UTF-8.
Shouldn't 'dos2unix' be patched to also remove this sequence? roozbeh -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/