The problem is simple, but here is the background:

I'm writing an interactive script that needs to work on both Mac and Windows running in Terminal on the Mac and in the "Perl Command Line Interpreter" (ActivePerl 5.10) on Windows.

The script and the text files it reads from are all utf-8 encoded and display properly on both platforms. At the moment I am using Windows 2000 for testing, since that's all I have here, though I have another machine with XP at another location.

Now for the problem:

If the script detects that it is running in Windows then I use Encode::from_to to convert the utf-8 to cp1252 for display in the "Perl Command Line Interpreter" console and for output. Output is fine, but non-ascii characters in the console window are wrongly displayed -- for example:

        Citroën is displayed as CitroÙn
        Microsoft® as Microsoft«
        - as ù

and yet when these incorrectly displayed characters are printed to the output file they are correctly printed, so the problem seems to be merely the "Interpreter's" interpretation of how to display the characters.

Does anybody know what charset this frightful console is using?

JD

Reply via email to