> > so a single text file can be interpreted > > as UTF-7, UTF-8, UTF-16, UTF-32, etc. if there's nothing to declare the > > exact character encoding used.
The whole point of defining UTF-8 this way has been to replace ASCII transparently. So if character sets need marks to identify them, the only one that should not need a mark and should be the default is UTF-8. --behdad behdad.org _______________________________________________ PersianComputing mailing list [EMAIL PROTECTED] http://lists.sharif.edu/mailman/listinfo/persiancomputing