On 12/12/2011 04:05 AM, mehdi houshmand wrote:
Hi Craig,

We're looking into this exact same problem, I'll let you know if
anything comes of it.

That's a handy co-incidence.

When you say "exactly" the same problem - is your work connected with Apache FOP output of embedded PDFs via Jeremias's fop-pdf-images extension ( http://www.jeremias-maerki.ch/download/fop/pdf-images/) too?

Or do you just mean that you're interested in de-duplicating and merging embedded subset fonts in general?

Have you made any progress since you started looking? Any avenues you've already ruled out?

What's the context you're interested in this for? Mine is a classified pagination application I'm developing in-house for my newspaper employer and will be releasing under an open source license (exact license yet to be determined) once the kinks are worked out or sooner if anyone's interested in it. It can use EPS image resources for PostScript output (to PDF via Distiller), but I'd prefer to produce native PDF if I can fix this font issue.

I'm going to be looking through the pdfbox and fontbox sources to see what sort of font handling code there already is. I'm particularly going to be searching for anything that parses and understands embedded fonts, as being able to easily determine which glyphs are already defined in an embedded subset would be a big help if I want to re-embed with a new subset rather than a whole font.

Any info on what you've already done to avoid duplicating work would be very handy.

--
Craig Ringer

Reply via email to