Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
I'm the webmaster of Project Gutenberg and I'm about to install the plucker distiller on the PG website. The idea is to have people download a ready-made plucker pdb instead of requiring them to run the distiller on the appropriate ebook file. I'm going to replace the text/plain parser with a

Why are bookmarks sorted?

2005-11-02 Thread Marcello Perathoner
I'm writing a custom text/plain parser. I'm parsing a text file and as I go along I add all chapter heads to the bookmark list using: PluckerDocs.PluckerTextDocument.add_bookmark When I look at the plucker database all bookmarks are sorted by title: Appendix A Appendix B Chapter 1

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Alexander R. Pruss
That's a wonderful idea. Are you going to be caching the pdbs, or will it be fast enough to generate on demand? Sorry, don't know about sorting of bookmarks. I myself added sorting of all records by URL to the parser, though, to keep chapters and the like in the right order. Maybe the

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Alexander R. Pruss wrote: That's a wonderful idea. Are you going to be caching the pdbs, or will it be fast enough to generate on demand? I'll have to cache them. Are you going to be making the docs split into 32K pages, or will you use the continuation flag to make each doc look like a

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
David A. Desrosiers wrote: I'm going to replace the text/plain parser with a custom one that will (try to) parse chapter heads, italics etc. out of the plain text. I'd be interested to see how you solve the context issue that has been brought up on the pg lists over the last year or so.

RE: Plucker server on Project Gutenberg

2005-11-02 Thread Lambert, Mark
I don't know if this would help or not, but I always go off the HTML version and break on any H1 or H2. That isn't perfect either, but is easier to do. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marcello Perathoner Sent: Wednesday, November 02,

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Lambert, Mark wrote: I don't know if this would help or not, but I always go off the HTML version and break on any H1 or H2. That isn't perfect either, but is easier to do. Not all PG ebooks have an HTML version. -- Marcello Perathoner [EMAIL PROTECTED]

RE: Plucker server on Project Gutenberg

2005-11-02 Thread Lambert, Mark
On Behalf Of Marcello Perathoner Sent: Wednesday, November 02, 2005 1:37 PM To: plucker-dev@rubberchicken.org Subject: Re: Plucker server on Project Gutenberg Lambert, Mark wrote: I don't know if this would help or not, but I always go off the HTML version and break on any H1 or H2. That

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Lambert, Mark wrote: But it is low-hanging fruit that would make it simpler for those that have HTML. If they have HTML, of course I use HTML. But more than half of them don't. -- Marcello Perathoner [EMAIL PROTECTED] ___ plucker-dev mailing

Re: Why are bookmarks sorted?

2005-11-02 Thread Chris Hawks
---Reply to mail from Marcello Perathoner about Why are bookmarks sorted? I'm writing a custom text/plain parser. I'm parsing a text file and as I go along I add all chapter heads to the bookmark list using: PluckerDocs.PluckerTextDocument.add_bookmark [...] I'm wondering why