Hi,

Am 08.04.21 um 23:33 schrieb Tim Allison:
Hi All,
   Two questions on 3.x... do we get the benefits of the on demand
parser "for free" in 3.x...in short, eo we have to load/parse the
document in a special way for the (hopefully) lower memory profile?
The on demand feature is the only way to parse and therefore it is default.

One can use different ways to read the input file which will have an impact on the memory consumption: buffered file, input stream and memory mapped file

Second question: will the on demand parser be more memory efficient
for page splitting?
The current implementation is the first stage of the on demand parser. It reads only those parts which are needed but it doesn't drop parts which aren't needed anymore. Saying that, if your interest in limited to parts of a pdf (one page out of 100) or to a single aspect of a pdf (text extraction instead of rendering) it should have a lower memory footprint, but if you have to parse the whole document you won't save any memory resources (maybe there are some minor improvements due to the many refactorings, but I don't expect a huge impact) Back to your question, it depends on your goal. Splirting a pdf into single pages won't most likely save any resources, save a handful of pages of a hugs pdf should have a lower footprint.

There is still a lot of room for improvements :-)

Andreas

   Thank you!

       Cheers,

             Tim

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to