In data lunedì 3 maggio 2010 16:23:06, Ferran Jorba ha scritto:
> Thanks for thee suggestion. I've tried it on one of our 100 MB+
> monsters and what I've seen is that the size doesn't vary. But
> certainly Xpdf's pdfinfo notes the change in the «Optimized» field:
>
> before after pdfopt
>
> Pages: 294 294
> Encrypted: no no
> File size: 112858067 bytes 112838493 bytes
> Optimized: no yes
> PDF version: 1.5 1.5
>
> Another task in our TODO list...
Yes, this is correct. The file is basically the same and the size should just
get a bit larger. What really changes is that the PDF is reorganized in a way
that a special table stored inside the PDF renderer in a easy position is
filled up with pointers to the pages, so that it's possible to jump directly
to any exact page (so this can be exploited in HTTP connections, to request
exactly the range of bytes which is sufficient to render the specific page,
while continuing pre-fetching the rest of the document in background). So if
you actually try to access your "optimized" monster through Multivio (if
Multivio is taking advantage of this feature) you should definitively be able
to jump to each page quickly, regardless of the size...)
Cheers,
Sam
P.s in the next release of Invenio there will be integrated a conversion
library that will, among other things, wrap this pdfopt in the fulltext
management operations, so that you can have in principle for free this
optimization (though in the current git master it's not currently fully
integrated yet in WebSubmit & friends...)
--
Samuele Kaplun ** CERN Document Server ** <http://cds.cern.ch/>