In data lunedì 3 maggio 2010 16:23:06, Ferran Jorba ha scritto:
> Thanks for thee suggestion.  I've tried it on one of our 100 MB+
> monsters and what I've seen is that the size doesn't vary.  But
> certainly Xpdf's pdfinfo notes the change in the «Optimized» field:
> 
>                  before                 after pdfopt
> 
>  Pages:          294                        294
>  Encrypted:      no                         no
>  File size:      112858067 bytes            112838493 bytes
>  Optimized:      no                         yes
>  PDF version:    1.5                        1.5
> 
> Another task in our TODO list...

Yes, this is correct. The file is basically the same and the size should just 
get a bit larger. What really changes is that the PDF is reorganized in a way 
that a special table stored inside the PDF renderer in a easy position is 
filled up with pointers to the pages, so that it's possible to jump directly 
to any exact page (so this can be exploited in HTTP connections, to request 
exactly the range of bytes which is sufficient to render the specific page, 
while continuing pre-fetching the rest of the document in background). So if 
you actually try to access your "optimized" monster through Multivio (if 
Multivio is taking advantage of this feature) you should definitively be able 
to jump to each page quickly, regardless of the size...)

Cheers,
        Sam

P.s in the next release of Invenio there will be integrated a conversion 
library that will, among other things, wrap this pdfopt in the fulltext 
management operations, so that you can have in principle for free this 
optimization (though in the current git master it's not currently fully 
integrated yet in WebSubmit & friends...)

-- 
Samuele Kaplun ** CERN Document Server ** <http://cds.cern.ch/>

Reply via email to