PDFdev is a service provided by PDFzone.com | http://www.pdfzone.com _____________________________________________________________

From: Leonard Rosenthol <[EMAIL PROTECTED]>

I do realize that not all PDFs use /ToUnicode (although their number is significant, approx. 600 out of 2000 in my randomly picked batch of files have it).

Your sampling isn't very "random", and I would bet that all of those 600 are from Distiller 5.0 or later, or Adobe InDesign. Almost none of the other PDF production tools out there genereate /ToUnicode.

Not true. In addition to our own PDFlib 5 which creates ToUnicode CMaps for certain combinations of fonts and encodings, I have PDFs with a ToUnicode CMap which have been generated by various versions of Distiller 4.x and Agfa Apogee Normalizer. I'm not sure, but converting an existing PDF into Tagged PDF by any of the available methods may also add a ToUnicode CMap for fonts which need one.

Would you happen to know the meaning of the word "def" in a Cmap (see original message)? And why would it be placed inside a dictionary?

It's part of the PostScript sequence "key value def" which is used to define certain entries in the CMap using PostScript language idioms.

The embedded dictionary contains the word def which trails each entry, =
and that makes the dictionary unreadable by my parser, since it expects =
the strict << /Name Value >> structure, and this nasty little word =
breaks it.=20

This assumption is wrong since CMaps could generally use the full PostScript language facilities. Although ToUnicode CMaps use a somewhat restricted subset (as detailed in the PDF reference) they still include PostScript stuff does not occur in other places within PDF (e.g. "12 dict begin").

Thomas



_______________________________________________________________
Thomas Merz          [EMAIL PROTECTED]        http://www.pdflib.com
Personalize PDF: PDFlib Personalization Server and Block plugin
_______PDFlib - a library for generating PDF on the fly________


To change your subscription: http://www.pdfzone.com/discussions/lists-pdfdev.html



Reply via email to