Re: [fpc-devel] Unicode support (again)

Michael Schnell Tue, 11 Nov 2008 04:16:06 -0800

Because e.g. on the ext3 file system, you can have two files with thename "ü" in the same directory. One named using the single character"ü" and one named using as the string "u¨" (both in utf-8). If youmake the compiler automatically normalise everything, you loseinformation (and get the security holes etc).

I see, but as this is not handled decently with good old ANSIStrings,anyway, there is not "friendly old school" way that a compiler would beable to offer. In these special cases, the user of course needs toexplicitly handle the upgrade of his project to unicode.

OTOH, in this special case, I don't see why the compiler should"normalize" "u¨" to "ü". If the software is supposed to be handlingunicode, the unicode string "u¨" should be considered a perfectly legaltwo-code-point information consisting of a "u" (a single sub-code inUTF-8) and a double-dot (supposedly two subcodes in UTF-8). If the userwants to handle this as a single "ü", he should write appropriate codefor that. Any automation on that is dangerous.


-Michael
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unicode support (again)

Reply via email to