Re: [fw-general] Zend_Pdf requirements

Bryce Lohr Thu, 08 Nov 2007 06:31:53 -0800


Rick Gigger wrote:

Yes, I was aware of those. I don't use them because I implemented myown xml based solutions that has support for tables, page breaking,columns, full rich text support, etc. The basic support in those twoengines was never enough. All I really need are the primitives tomeasure, style and draw text, and of course primitive line and shapedrawing operations. I have an abstraction layer that lets me swapback and forth between R&OS and FPDF. My XML parser / DOM stylerendering library handles the rest. So I don't think it would be toohard to use Zend_Pdf instead.

That sounds really interesting... I can see why switching to Zend_Pdfwouldn't be too hard at that point.

The PDF renderer needs to map the text it is trying to render to anavailable font. Unfortunately none of the standard PDF fonts areunicode fonts. So if you want to be able to actually put unicode textin and have it understood then you have to (if I am reading the speccorrectly) embed the unicode font right into the PDF . Licensingissues aside entire unicode fonts are about 10-20 Mb. Most of thePDFs I generate are under 50 Kb so that's certainly not going towork. So what I am left with is to determine the language of eachblock of text and convert it to a local encoding that the PDF specwill happily accept. That seems a little tricky.

I'm no PDF expert, but for some reason I thought you could embed partialfonts. Meaning, you would only have to embed the glyphs you actuallyused in the document. Seemingly, you could do that with the Unicode fontto drastically bring down the size requirement.

Say you have a small segment of Japanese with only a few Kanjicharacters. How do I distinguish that from a few characters ofTraditional Chinese? Lets say I figure that part out but then theyhave say Korean mixed into the same text with Japanese. I have tosplit out the Korean from the Japanese and draw them separately eachtime converting to a local encoding and indicating the correct font touse.
Anyway that is the problem as I see it. Once I get this figured out Iwould happily contribute any or all of it to Zend_Pdf if the authorwants it. I have this hope that someone working on the Zend_Pdf isgoing to say that I've read the spec all wrong and that I can somehowadd unicode text directly to the pdf and have the PDF reader map itall to a unicode font if present or into the various non-unicode localfonts if if it's not. And of course if whoever is working on theUTF-8 support for Zend_Pdf figures it all out and implements it I'llbe more than happy to just switch my rendering engine to just useZend_Pdf for primitives and get the unicode support for free.

That definitely sounds like a tricky problem. I haven't read all the PDFspec, nor have I worked much with the Zend_Pdf code, so I unfortunatelydon't have any answers, here. I suppose you could just build a PDF filewith all those characters, and see if it comes out as gibberish.


Regards,
Bryce Lohr

Re: [fw-general] Zend_Pdf requirements

Reply via email to