Tom Allison schreef:

> I have a string:
> =?iso-2022-jp?B?Rlc6IBskQjxkJDckNSRHJE8kSiQvJEYzWiQ3JF8kPyQkGyhC?=
> That is a MIME::Base64 encoded string of iso-2022-jp characters.
>
> After I decode_base64 them and decode($text,'iso-2022-jp',utf8') them
> I can print out something that looks exactly like japanese characters.
>
> But you can't match /(\w+) on them.  It's apparently one "word"
> without spaces in it.

http://www.patentstorm.us/patents/5337233-description.html
(look for JLE)

So maybe if you convert to EUC, than insert spaces as the text suggests,
than convert back to utf8, you might have a "better" string to work
with.

-- 
Affijn, Ruud

"Gewoon is een tijger."


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to