DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16992>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16992

Unrecognised character "euro"





------- Additional Comments From [EMAIL PROTECTED]  2003-02-14 19:45 -------
The likely cause for this is an attempt to use the Windows-1252 (Codepage-1252) euro 
character 
(0x80) instead of the the Unicode euro (0x20AC). Windows-1252 is an extension of 
ISO-8859-1 (Latin-
1) that added 27 symbols not found in Latin-1 (but found in the full Unicode character 
set) 

The 
org.apache.poi.util.StringUtil class specifies an encoding ("ISO-8859-1") that does 
not 
recognize the 27 extension symbols, and maps them on to "?". It would if it specified 
the encoding 
"Cp1252" instead.

I have a tested fix for this that works with all standard Unicode as well as 
the Cp1252 extensions, but haven't submitted it to Poi yet.

Carey Sublette

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to