I'm the webmaster of Project Gutenberg and I'm about to install the
plucker distiller on the PG website. The idea is to have people download
a ready-made plucker pdb instead of requiring them to run the distiller
on the appropriate ebook file.
I'm going to replace the text/plain parser with a
I'm writing a custom text/plain parser.
I'm parsing a text file and as I go along I add all chapter heads to the
bookmark list using:
PluckerDocs.PluckerTextDocument.add_bookmark
When I look at the plucker database all bookmarks are sorted by title:
Appendix A
Appendix B
Chapter 1
That's a wonderful idea. Are you going to be caching the pdbs, or will
it be fast enough to generate on demand?
Sorry, don't know about sorting of bookmarks. I myself added sorting of
all records by URL to the parser, though, to keep chapters and the like
in the right order. Maybe the
Alexander R. Pruss wrote:
That's a wonderful idea. Are you going to be caching the pdbs, or will
it be fast enough to generate on demand?
I'll have to cache them.
Are you going to be making the docs split into 32K pages, or will you
use the continuation flag to make each doc look like a
David A. Desrosiers wrote:
I'm going to replace the text/plain parser with a custom one that will
(try to) parse chapter heads, italics etc. out of the plain text.
I'd be interested to see how you solve the context issue that has
been brought up on the pg lists over the last year or so.
I don't know if this would help or not, but I always go off the HTML
version and break on any H1 or H2. That isn't perfect either, but is
easier to do.
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Marcello
Perathoner
Sent: Wednesday, November 02,
Lambert, Mark wrote:
I don't know if this would help or not, but I always go off the HTML
version and break on any H1 or H2. That isn't perfect either, but is
easier to do.
Not all PG ebooks have an HTML version.
--
Marcello Perathoner
[EMAIL PROTECTED]
On Behalf Of Marcello Perathoner
Sent: Wednesday, November 02, 2005 1:37 PM
To: plucker-dev@rubberchicken.org
Subject: Re: Plucker server on Project Gutenberg
Lambert, Mark wrote:
I don't know if this would help or not, but I always go off the HTML
version and break on any H1 or H2. That
Lambert, Mark wrote:
But it is low-hanging fruit that would make it simpler for those that
have HTML.
If they have HTML, of course I use HTML. But more than half of them don't.
--
Marcello Perathoner
[EMAIL PROTECTED]
___
plucker-dev mailing
---Reply to mail from Marcello Perathoner about Why are bookmarks sorted?
I'm writing a custom text/plain parser.
I'm parsing a text file and as I go along I add all chapter heads to the
bookmark list using:
PluckerDocs.PluckerTextDocument.add_bookmark
[...]
I'm wondering why
10 matches
Mail list logo