On 12/12/2011 04:05 AM, mehdi houshmand wrote:
Hi Craig,
We're looking into this exact same problem, I'll let you know if
anything comes of it.
That's a handy co-incidence.
When you say "exactly" the same problem - is your work connected with
Apache FOP output of embedded PDFs via Jeremias's fop-pdf-images
extension ( http://www.jeremias-maerki.ch/download/fop/pdf-images/) too?
Or do you just mean that you're interested in de-duplicating and merging
embedded subset fonts in general?
Have you made any progress since you started looking? Any avenues you've
already ruled out?
What's the context you're interested in this for? Mine is a classified
pagination application I'm developing in-house for my newspaper employer
and will be releasing under an open source license (exact license yet to
be determined) once the kinks are worked out or sooner if anyone's
interested in it. It can use EPS image resources for PostScript output
(to PDF via Distiller), but I'd prefer to produce native PDF if I can
fix this font issue.
I'm going to be looking through the pdfbox and fontbox sources to see
what sort of font handling code there already is. I'm particularly going
to be searching for anything that parses and understands embedded fonts,
as being able to easily determine which glyphs are already defined in an
embedded subset would be a big help if I want to re-embed with a new
subset rather than a whole font.
Any info on what you've already done to avoid duplicating work would be
very handy.
--
Craig Ringer