Be aware.....
Once CP932, always CP932...even in UTF-8.
The UTF8 bytecode is a representation of the character in CP932.
UTF-8 is an encoding for Unicode code points. So I've got no idea
what you mean by the above. In order to convert from CP932 (or any
non-Unicode encoding) into UTF-8 a program first has to convert
characters into a sequence of Unicode code points, which then get
encoded using the UTF-8 transformation.
Don't expect that you can go from CP932 to UTF8 to EUC_JP. Ain't goin'
to happen.
Again, I don't understand what you're talking about here. Assuming
the conversion from CP932 to UTF-8 was done properly, then converting
from UTF-8 to EUC_JP is the reverse of above - UTF-8 byte sequences
are converted into a stream of Unicode code points, which then need
to be mapped and encoded for EUC_JP.
Check out a program called JCode. It might help you. I'm running
into the problem where Linux/Unix is not totally CP932 capable. I
represent everything as UTF-8...So, all's ok. But, I can't expect
to convert UTF-8 to CP932 on Unix. That ain't going to work either.
The PALM handheld is EUC_JP (I'm pretty posstive of that).
It's CP932 (+/- a few characters).
When I enter
japanse on the handheld, sometimes Windows has a problem because the
character is not in the default CP932 character set (get's converted to ?'s).
Which characters?
-- Ken
--
Ken Krugler
<http://www.krugler.org>
+1 530-265-2225
--
For information on using the Palm Developer Forums, or to unsubscribe, please see http://www.palmos.com/dev/support/forums/