> For UTF-16 with signature, I agree, but UTF-8 could sometimes match a
> Latin-1 file.

I would like to stress that. Latin-1 (ISO-8859-1) is a superset of
US-ASCII. The first 128 characters are basically US-ASCII. On the other
hand, although UTF-8 is a variable length encoding, it is designed to
match US-ASCII in it's first 128 characters (Unicode range U+0000 to
U+007F). These characters are encoded as single bytes in UTF-8.

So, every single US-ASCII file out there is a valid UTF-8 file. This is
one of the features of UTF-8.

And for historical reasons, every US-ASCII file is a valid Latin-1 file.


-- 
Поздрави,
Калоян Доганов,
Сдружение "Свободен софтуер".
___________________________________________________________
Ако не отговарям на писмата Ви: http://6lyokavitza.org/mail


_______________________________________________
Emacs-devel mailing list
Emacs-devel@gnu.org
http://lists.gnu.org/mailman/listinfo/emacs-devel

Reply via email to