Well, I've finally had a chance to look at this. Here's what happened: FWIW I'm using FOP 0.20.3 (the latest binary release).
If I try an encoding of 'cp1252' or 'windows-1252' in the xml file I get an error saying that the encoding is not supported. Not happy with this I added the latest xerces.jar (1.4.4) to CLASSPATH and it worked with an encoding of 'windows-1252'. The same happened when I added xercesImpl.jar (Xerces-2.0.0) to CLASSPATH. Thank you Rainer, once again, for your help with this problem. I've copied this to the dev list so a developer can pick this up and change whatever it is they need to change if they so desire. Cheers Tim -----Original Message----- From: Keen Tim Sent: Tuesday, 5 February 2002 9:25 To: [EMAIL PROTECTED] Subject: RE: Displaying characters in PDF Rainer, I think you hit the nail on the head. Initially the character encoding of the source was not supported by the xml-parser of fop. This led me to use iso-8859-1 just to get it working. As you suggest this is probably not the appropriate encoding. As you guessed the most likely source of this data is a windows editor, so on your sound advice I looked at those web sites and will try 'cp1252' and 'windows-1252'. Thanks, once again, for your help. If I have any more problems I will certainly let you (and others) know. Cheers Tim -----Original Message----- From: Rainer Garus [mailto:[EMAIL PROTECTED] Sent: Tuesday, 5 February 2002 7:07 To: [EMAIL PROTECTED] Subject: RE: Displaying characters in PDF > I'm fairly dumb when it comes to fonts. My problem is that I don't > control the source of the character and I would expect that in most > cases the characters that are a problem are in ANSI set from 127-255. > From what you're saying I'll have to pre-parse the string and replace it > with the matching character in Adobe. Is that right? Is there another > way? If the character encoding of the source is supported by the xml-parser of fop then you have to do nothing. The xml-parser maps the characters to the correct unicode values. Fop use the xerces parser, see http://xml.apache.org/xerces-j/faq-general.html#faq-8 for the supported encodings. What character encoding do you use? It seems to be an extended ascii encoding which contains printable characters in the interval 128 .. 159, so it is not iso-8895-1. You can try cp1252 (latin 1 windows) (see http://czyborra.com/charsets/iso8859.html). Rainer Garus ************************************************************************ The information in this e-mail together with any attachments is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any form of review, disclosure, modification, distribution and/or publication of this e-mail message is prohibited. If you have received this message in error, you are asked to inform the sender as quickly as possible and delete this message and any copies of this message from your computer and/or your computer system network. ************************************************************************ ************************************************************************ The information in this e-mail together with any attachments is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any form of review, disclosure, modification, distribution and/or publication of this e-mail message is prohibited. If you have received this message in error, you are asked to inform the sender as quickly as possible and delete this message and any copies of this message from your computer and/or your computer system network. ************************************************************************
