False alarm. JPedal and kpdf, for example, have no problems reconstructing the correct text based on the embedded Encoding. PDFBox, too, but that one had problems extracting from text written using a TrueType font. But Adobe Acrobat Reader does have a problem: text written in a Cyrillic Type 1 font is extracted incorrectly. Not even adding a ToUnicode CMap helped here. Probably a bug. I have no other idea who I could help Acrobat extract the text correctly. So, if you care about copy/paste from Acrobat, switch to TrueType fonts instead of using fonts with encodings other than AdobeStandardEncoding or WinAnsiEncoding.
On 13.02.2008 12:12:59 Jeremias Maerki wrote: <snip/> > BTW, I just found out that I have to generate a ToUnicode CMap if a > Type1 font doesn't use one of the encodings that are predefined in the > PDF spec. So a little more work for me there. <snip/> Jeremias Maerki