Hi all

With pdf-image, is there any way to coalesce or merge multiple different subsets of the same font into a single font subset with no duplicate glyphs? Eg 50 different "Helvetica (subset)" instances into a single font in the output document?

Background:

I've just got Jeremias's pdf-image extension integrated into my code. It worked perfectly and immediately with little effort, which was delightful. Thankyou *VERY* much Jeremias for publishing that, it's a fantastic tool and I'd love to see it in fop core.

I'm encountering an unexpected issue with it, though: the PDFs produced by fop are *huge*. Examination with Acrobat Pro suggests that 90% of the space is taken up by fonts. Looking at the font list, I see huge numbers of copies of "Helvetica (subset)", "Helvetica Black (subset)" etc. That makes sense, since all the input PDFs have fonts embedded, and many use the same fonts. However, I'm including up to 1000 PDFs in each output PDF so the size adds up to prohibitive levels.

I'm wondering if there's any way to tell the pdf-image extension to embed certain fonts fully from supplied font files and avoid copying the matching subsets over from the input PDFs. If there isn't anything like that, any idea how practical it'd be?

For that matter, is the idea of collecting up all the subsets of a font as each pdf-image is embedded, then merging them into a single new embedded subset at the end completely insane? Or is it potentially practical? For that matter just keeping track of which glyphs are defined in each subset and building a new subset from a master font file at the end that included all those glyphs would help a lot.

I'm *really* hoping to avoid having to keep on using EPS input and PostScript output to PDF via Distiller, so I'm willing to put some work into this.

--
Craig Ringer

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to