On 1/12/08, Neil Gunton <[EMAIL PROTECTED]> wrote:
>
> I am somewhat experienced with Perl in general, but absolutely no
> experience dealing with UTF-8. I have a community journals website which
> allows updates from users via email. I'm having trouble with emails that
> contain Chinese characters encoded (I think) as UTF-8. The strings look
> like this:
>
> =?UTF-8?B?5qGQ5LmhLCBUb25neGlhbmc6IEJlaW5nIGEgJ2hhbg==?= =?UTF-8?B?dHUn?=
>
> When I read this text from a file, using my perl script, and then save
> it into MySQL, it comes out on the website looking literally like the
> above. I can't seem to get perl to "do" anything with it in terms of
> conversions to a format that looks like chinese characters when
> displayed on the Web.

  use Encode;
  use Encode::MIME::Header;
  decode("MIME-Header", $bytes);

to get the Unicode strings for these MIME encoded characters.

> Does anybody have any clues as to how to convert strings like this into
> something more usable - e.g. HTML character entities?

If you want to turn them into HTML entities, you can say:

  encode("ascii", decode("MIME-Header", $bytes), Encode::FB_HTMLCREF);

HTH

-- 
Tatsuhiko Miyagawa

Reply via email to