pdf-image: Handling size blow-out caused by fonts, any way to coalesce/merge multiple subsets of same font?

Craig Ringer Mon, 12 Dec 2011 00:56:25 -0800

Hi all

With pdf-image, is there any way to coalesce or merge multiple differentsubsets of the same font into a single font subset with no duplicateglyphs? Eg 50 different "Helvetica (subset)" instances into a singlefont in the output document?


Background:

I've just got Jeremias's pdf-image extension integrated into my code. Itworked perfectly and immediately with little effort, which wasdelightful. Thankyou *VERY* much Jeremias for publishing that, it's afantastic tool and I'd love to see it in fop core.

I'm encountering an unexpected issue with it, though: the PDFs producedby fop are *huge*. Examination with Acrobat Pro suggests that 90% of thespace is taken up by fonts. Looking at the font list, I see huge numbersof copies of "Helvetica (subset)", "Helvetica Black (subset)" etc. Thatmakes sense, since all the input PDFs have fonts embedded, and many usethe same fonts. However, I'm including up to 1000 PDFs in each outputPDF so the size adds up to prohibitive levels.

I'm wondering if there's any way to tell the pdf-image extension toembed certain fonts fully from supplied font files and avoid copying thematching subsets over from the input PDFs. If there isn't anything likethat, any idea how practical it'd be?

For that matter, is the idea of collecting up all the subsets of a fontas each pdf-image is embedded, then merging them into a single newembedded subset at the end completely insane? Or is it potentiallypractical? For that matter just keeping track of which glyphs aredefined in each subset and building a new subset from a master font fileat the end that included all those glyphs would help a lot.

I'm *really* hoping to avoid having to keep on using EPS input andPostScript output to PDF via Distiller, so I'm willing to put some workinto this.


--
Craig Ringer

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

pdf-image: Handling size blow-out caused by fonts, any way to coalesce/merge multiple subsets of same font?

Reply via email to