Hi Julius, > Sorry to muddy the waters. It would be really nice if we could assume > UTF-8 > for everything. HTML entities are not very readable except when using a > browser!
Agreed. Up until now, Odi's last name was the only case for non-ASCII characters though :-) > As part of the https hostname verification I've been working on, I've been > testing with some UTF-8 in the hostname. I would really prefer to just > write "花子.co.jp" directly in the javadocs instead of "花& > #x5b50;.co.jp". Isn't that hostname against all specs? I heard of something called punycode for international domain names ;-) (The second kanji means "child", right?) Even if you write that in the source code, that doesn't mean everybody has a font installed to display it. Little squares indicating "undisplayable character" are even less readable than HTML character references. > Can we assume UTF-8 for everything, but try to stick to the 0-127 range of > UTF-8 if at all possible (to be polite)? I'd like to hear some more opinions on this. Does anybody know how well Subversion handles UTF-8 text files? No automatic conversion to local codepages on checkout or other unexpected surprises? Everybody has a UTF-8 compatible editor that will not silently convert to a different encoding? I'm sure I can figure out how to use Emacs for that, in due time. cheers, Roland --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
