Re: On-demand parser in 3.x

Andreas Lehmkuehler Thu, 08 Apr 2021 23:24:15 -0700

Hi,

Am 08.04.21 um 23:33 schrieb Tim Allison:

Hi All,
   Two questions on 3.x... do we get the benefits of the on demand
parser "for free" in 3.x...in short, eo we have to load/parse the
document in a special way for the (hopefully) lower memory profile?

The on demand feature is the only way to parse and therefore it is default.

One can use different ways to read the input file which will have an impact onthe memory consumption: buffered file, input stream and memory mapped file

Second question: will the on demand parser be more memory efficient
for page splitting?

The current implementation is the first stage of the on demand parser. It readsonly those parts which are needed but it doesn't drop parts which aren't neededanymore. Saying that, if your interest in limited to parts of a pdf (one pageout of 100) or to a single aspect of a pdf (text extraction instead ofrendering) it should have a lower memory footprint, but if you have to parse thewhole document you won't save any memory resources (maybe there are some minorimprovements due to the many refactorings, but I don't expect a huge impact)Back to your question, it depends on your goal. Splirting a pdf into singlepages won't most likely save any resources, save a handful of pages of a hugspdf should have a lower footprint.


There is still a lot of room for improvements :-)

Andreas

   Thank you!

       Cheers,

             Tim

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: On-demand parser in 3.x

Reply via email to