Re: Convert Codepage 932 (Japanese) to Unicode

Ken Krugler Sun, 12 Jan 2003 20:08:28 -0800

Be aware.....

Once CP932, always CP932...even in UTF-8.
The UTF8 bytecode is a representation of the character in CP932.

UTF-8 is an encoding for Unicode code points. So I've got no idea what you mean by the above. In order to convert from CP932 (or any non-Unicode encoding) into UTF-8 a program first has to convert characters into a sequence of Unicode code points, which then get encoded using the UTF-8 transformation.

Don't expect that you can go from CP932 to UTF8 to EUC_JP.  Ain't goin'
to happen.

Again, I don't understand what you're talking about here. Assuming the conversion from CP932 to UTF-8 was done properly, then converting from UTF-8 to EUC_JP is the reverse of above - UTF-8 byte sequences are converted into a stream of Unicode code points, which then need to be mapped and encoded for EUC_JP.

Check out a program called JCode.  It might help you.  I'm running
into the problem where Linux/Unix is not totally CP932 capable.  I
represent everything as UTF-8...So, all's ok.  But, I can't expect
to convert UTF-8 to CP932 on Unix.  That ain't going to work either.


The PALM handheld is EUC_JP (I'm pretty posstive of that).

It's CP932 (+/- a few characters).

When I enter
japanse on the handheld, sometimes Windows has a problem because the
character is not in the default CP932 character set (get's converted to ?'s).

Which characters?

-- Ken
--
Ken Krugler
<http://www.krugler.org>
+1 530-265-2225

--
For information on using the Palm Developer Forums, or to unsubscribe, please see http://www.palmos.com/dev/support/forums/

Re: Convert Codepage 932 (Japanese) to Unicode

Reply via email to