Re: [fpc-devel] Unicode support in RTL - Roadmap

Martin Friebe Fri, 21 Nov 2008 10:19:55 -0800

Felipe Monteiro de Carvalho wrote:

On Fri, Nov 21, 2008 at 2:42 PM, Michael Schnell <[EMAIL PROTECTED]> wrote:

And thus forces all users to "understand the full UTF-8 spec" and to rewrite
their programs, even though the old code perfectly compiles and up to a
certain extent seems to work.


This is what I think is "not at all desirable" :( .

Your comments are absolutely vague and meaningless. Not to mention
thay also don't propose an alternative.

Sorry to be blunt, but so were your comments

I must agree with the "FPC can not to it all automatically" line (asmuch as I regret, and admit the beauty there was, if fpc could).


What I mean is:

1) Any Application/Program, that currently compiles and works (usingnone utf8, never mind if ascii or ansi) will keep working, if compiledusing *none* utf8 mode.

2) If such a program wants to be compiled to be extended to utf8support, then there is a need for decisions that can not be made withoutknowledge what the program is doing. Or even within the same program inwhich context the operation takes place.Such knowledge is only available to the programmer of this application,therefore the application must be changed to include this decisions. FPCsimple can not make them. (And even {$SWITCH} would not solve the issue.)


Example is the composed and decomposed "ü":

- If you edit a text (human readable text), or search in a text, youcertainly do want to handle both representations as equals (a Finddialog must find both)- If the same text editor saves the file, it must handle them as nonequal. Assume the user has 2 files "wünsche.txt" in the same folder.The filesystem allows this, because one of them is decomposed and one iscomposed. If the user had opened a text from the composed version, itshould be written back to the composed version. If the user had openedit from the decomposed version it must be written back to the decomposedversion. Otherwise a completely unrelated file would simply beoverwritten, and the contents lost. (the same applies if the applicationiterates through the directory content and compares file names. So herethe same compare version that would be used by the "Find dialog" mustbehave different)

FPC can simply not know, if a string contains a file name, which must bekept exactly as it, or a string contains some human readable text, whichwould benefit from a "normalisation".

If you are going to put a compiler switch in front of each statement toindicate the needs, you may as well change the statements. There is noone statement for the whole application, as both of the above exampleoccur within a single application.

You could use two different UTF8Strings which behave different ondecomposed chars (I am *not* proposing this as a solution). But then youcan not just recompile your app by saying "string" now means UTF8Stringthroughout the whole application. You have again to go through all ofthe source code and edit the app. So you may as well just go through thesourcecode, and add the appropriate utf8-clean up calls to those part inthe code, that will need it.

In the end, switching an application to unicode means that within thesame app different parts are going to need different handling of unicode(where no such difference existed for ascii/ansi). And no compiler canfigure out which part will need which behaviour.



_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unicode support in RTL - Roadmap

Reply via email to