Ok I think I've got this figured out. utf8::decode() does what u want. The bytes in @bytes represent the constituent octets in "tesuto" in Kana. Using utf::decode successfully turned the 9 bytes into 3 characters. Let me know if this gets what u need.
@bytes = split /\|/, "e3|83|86|e3|82|b9|e3|83|88"; @bytes = map {chr hex $_} @bytes; print scalar @bytes, "\n"; $text = join "", @bytes; print length $text, "\n$text\n"; utf8::decode($text) or die; print length $text, "\n$text\n"; ^D 9 9 pâåpé¦pâê Wide character in print at - line 7. 3 pâåpé¦pâê -- REMEMBER THE WORLD TRADE CENTER ---=< WTC 911 >=-- "...ne cede malis" 00000100 _______________________________________________ Perl-Win32-Users mailing list Perl-Win32-Users@listserv.ActiveState.com To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs