Re: Query regarding encoding used internally by apache POI libraries

Nick Burch Tue, 08 Sep 2009 02:49:08 -0700

On Tue, 8 Sep 2009, Som Satpathy wrote:

Does apache POI follow any particular encoding internally whileextracting MS office documents? If so what is the encoding that POIuses?

POI is written in Java, so uses native java strings almost everywhere.These are unicode

The microsoft file formats generally store text as either US-ASCII orUCS-2. The type of the record/block/etc tells you which it is, so we canturn that into java (unicode) strings


Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: Query regarding encoding used internally by apache POI libraries

Reply via email to