On Tue, 14 Jun 2005, Brian Chrisman wrote:
Sebastian Smith wrote:
Hi All,
I haven't RTFM on this one yet, but could use some advice anyway.
I've downloaded the online version of a textbook using wget. It's a great
book, but a pain to view in html, and not very portable (there are a couple
thousand files associated with this book). So, what I'd like to do is
convert the book to pdf. I'm not sure how to go about it though. I'd like
all links to remain intact so that navigating through the book is still
easy.
Here is a link to the book:
http://www.cs.ualberta.ca/%7Esutton/book/ebook/the-book.html
I appreciate your suggestions.
How good do you want it to be? There are things like html2pdf out there
which can vaguely do what you're looking for... the big limitation being that
pdf is a typesetting language, whereas HTML is nominally a display-neutral
markup language.... ie, there's no real concept in html of where to stop or
start a particular page, footer, or what not, while there is in pdf.
XSL:FO(and to some extent, XSLt) is all about turning XML into pdf documents,
but FO is extremely... err.. verbose...
Later versions of CSS also seem to be doing somewhat of a half-assed job at
becoming a typesetting language.
I'd like the quality to match the html pages if possible. I'm looking
into a program called tidy for switching all html files to xml files.
From there I can run a program called fop to convert to pdf. Not sure how
well this will work, but I'll keep everyone posted.
- Sebastian
_______________________________________________
RLUG mailing list
[email protected]
http://lists.rlug.org/mailman/listinfo/rlug