On Wed, 9 Sep 2009, Som Satpathy wrote:
"The microsoft file formats generally store text as either US-ASCII or
UCS-2. The type of the record/block/etc tells you which it is, so we can
turn that into java (unicode) strings"

Thanks for the input Nick. But one thing is still not clear, can I encode the text as UTF_8?

No. You need to ensure that by the time you pass the string to POI, it's a valid java (unicode) string, and you've done any utf8 decoding as required when reading the string in. When getting data out of poi, it'll come out as a java unicode string. It's up to you to turn that into utf8 if that's what you want to output. I'd suggest you go read a tutorial on working with unicode and native character sets in java to clear up your confusions.

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to