On Tue, 14 Jun 2005, Brian Chrisman wrote:

Sebastian Smith wrote:

Hi All,

I haven't RTFM on this one yet, but could use some advice anyway.

I've downloaded the online version of a textbook using wget. It's a great book, but a pain to view in html, and not very portable (there are a couple thousand files associated with this book). So, what I'd like to do is convert the book to pdf. I'm not sure how to go about it though. I'd like all links to remain intact so that navigating through the book is still easy.

Here is a link to the book:

http://www.cs.ualberta.ca/%7Esutton/book/ebook/the-book.html

I appreciate your suggestions.

How good do you want it to be? There are things like html2pdf out there which can vaguely do what you're looking for... the big limitation being that pdf is a typesetting language, whereas HTML is nominally a display-neutral markup language.... ie, there's no real concept in html of where to stop or start a particular page, footer, or what not, while there is in pdf. XSL:FO(and to some extent, XSLt) is all about turning XML into pdf documents, but FO is extremely... err.. verbose... Later versions of CSS also seem to be doing somewhat of a half-assed job at becoming a typesetting language.

I'd like the quality to match the html pages if possible. I'm looking into a program called tidy for switching all html files to xml files.
From there I can run a program called fop to convert to pdf. Not sure how
well this will work, but I'll keep everyone posted.

- Sebastian

_______________________________________________
RLUG mailing list
[email protected]
http://lists.rlug.org/mailman/listinfo/rlug

Reply via email to