> The file is already in UTF-8, otherwise it wouldn't display properly
> in Firefox or IE.  The problem is either your display or perl doesn't
> know that the file is in UTF-8.
>
> The first step is make sure Perl knows it is working with UTF-8.  Add
>
> export PERL_UNICODE=SDL
>
> to your .profile, .bashrc, or whatever you use for your profile.
> Logout and log back in.  This tells perl to use UTF-8 for STDIN,
> STDOUT, and STDERR (the S), input and output streams (the D), and all
> of it dependent on locale (the L).   The next thing to check is the
> value in your LANG environment variable.  It should be something like
> en_US.UTF-8.  If you are still having problems check to see if your
> terminal is expecting something other than UTF-8 (this is highly
> dependent on the terminal, so you will need to tell us what terminal
> you are using).

Thank you, this was very helpful.  I added those lines to my .bashrc and
set my terminal to UTF-8, and my text editor to UTF-8, and I see those
characters display correctly now.  I even used my editor to save the CSV
encoded as Windows-1250 and got it to open and display correctly in
Microsoft Excel on a Windows box.

Incidentally, I still get a warning with -w:

  Parsing of undecoded UTF-8 will give garbage when decoding entities at
  /usr/share/perl5/LWP/Protocol.pm line 114.

I am using Perl 5.8.8 on linux.  I might be doing something wrong, or
perhaps upgrading to Perl 5.10.0, which I plan to do in the next week
anyway, will fix this. I will look into it further.

Thanks again,

John

Reply via email to