Hi Will;

"What I did here was 
to use the Tidy engine/library to fix up my HTML into valid XML."

This library you are referring to. Which package is it part of?

Thanks,

Luke

----- Original Message ----- 
From: "Will Gilbert" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, January 04, 2005 11:30 AM
Subject: Re: New To Fop


> 
> Luke,
> 
> What you are looking to do comes up pretty often on this list so you 
> will probably get quite a bit of help.
> 
> Here's mine....
> 
> What you will need to do is go from HTML into XML then into FO, once in 
> FO, FOP can render it quite quickly into a PDF, your browser can even 
> be used as the delivery mechanism.
> 
> I wrote a Java Servlet which is invoked via an HTML page link, the 
> links passes the necessary parameters. In your case that will be a 
> reference to the original HMTL file.
> 
> The next step is not obvious, hence this e-mail.   Not all HTML is 
> XML-ready, humans make mistakes which most browsers correct, unbalanced 
> and missing tags for example.  Also some tags need to be doctored, <BR> 
> and <HR> come to mind, these have no closing tags.  What I did here was 
> to use the Tidy engine/library to fix up my HTML into valid XML.
> 
> Now the job gets pretty easy...
> 
> The next step is to develop an XSL transform which takes HMTL tags and 
> create FO XML.  I have some transforms which I am very happy to share 
> with you, as will others.  Nobody has a complete HTML to FO 
> implementation as this would be huge but you can get most of the 
> transform working quickly and then add to it as needed.
> 
> Once you have the FO XML -- BOOM, a few lines of code later and you've 
> got your PDF.
> 
> The servlet I wrote actually communicates back to the browser every 
> second and fakes an elapsed progress timer.  We had to do this as we 
> originally were running on slow hardware and have very impatient.  With 
> our hardware these days the transform and PDF generation runs so 
> quickly, the interaction is more of a nuisance that an aid.  But at the 
> time that is what the boss wanted, so I wrote it.
> 
> --will
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to