On Mon, 17 Feb 2003, Michael Everson wrote:

> X browsers, and the keepers of that home page should delete the first
> character before the HTML begins right away. I am cc:ing the keepers

  I agree that they should.

  Incidentally, it just occurred to me that  ftp/ssh clients may offer an
user-configurable option for the  automatic removal of  'UTF-8 BOM' at
the beginning of a text file in UTF-8 when moving files from Windows to
non-Windows platforms (Unix/Unix-like OS and MacOS). The same is true
of Kermit (Frank, are you here?). All those tools can be configured
to translate between three (and nowadays even more?) EOL conventions,
CF/LF/CR,LF for text files. Then, the automatic removal(and addition if
that's regarded as necessary) of UTF-8 BOM at platform boundaries
would be as useful.

   As for web servers, a configurable option can be added to remove
UTF-8 BOM at the beginning of text/* files(they serve). For instance,
I think it's not hard to write a simple module for Apache(used at Unicode.org
web site) to do that.

  With these additional guards, UTF-8 BOM added by some editors
without users' notice would be made less of problem.

   Jungshik


Reply via email to