> For UTF-16 with signature, I agree, but UTF-8 could sometimes match a > Latin-1 file.
I would like to stress that. Latin-1 (ISO-8859-1) is a superset of US-ASCII. The first 128 characters are basically US-ASCII. On the other hand, although UTF-8 is a variable length encoding, it is designed to match US-ASCII in it's first 128 characters (Unicode range U+0000 to U+007F). These characters are encoded as single bytes in UTF-8. So, every single US-ASCII file out there is a valid UTF-8 file. This is one of the features of UTF-8. And for historical reasons, every US-ASCII file is a valid Latin-1 file. -- Поздрави, Калоян Доганов, Сдружение "Свободен софтуер". ___________________________________________________________ Ако не отговарям на писмата Ви: http://6lyokavitza.org/mail _______________________________________________ Emacs-devel mailing list Emacs-devel@gnu.org http://lists.gnu.org/mailman/listinfo/emacs-devel