False alarm. JPedal and kpdf, for example, have no problems
reconstructing the correct text based on the embedded Encoding. PDFBox,
too, but that one had problems extracting from text written using a
TrueType font. But Adobe Acrobat Reader does have a problem: text
written in a Cyrillic Type 1 font is extracted incorrectly. Not even
adding a ToUnicode CMap helped here. Probably a bug. I have no other
idea who I could help Acrobat extract the text correctly. So, if you
care about copy/paste from Acrobat, switch to TrueType fonts instead of
using fonts with encodings other than AdobeStandardEncoding or
WinAnsiEncoding.

On 13.02.2008 12:12:59 Jeremias Maerki wrote:
<snip/>
> BTW, I just found out that I have to generate a ToUnicode CMap if a
> Type1 font doesn't use one of the encodings that are predefined in the
> PDF spec. So a little more work for me there.
<snip/>



Jeremias Maerki

Reply via email to