[jira] [Commented] (PDFBOX-4569) Implement an ondemand Parser

Maruan Sahyoun (JIRA) Sun, 16 Jun 2019 23:25:56 -0700


    [ 
https://issues.apache.org/jira/browse/PDFBOX-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16865356#comment-16865356
 ]


Maruan Sahyoun commented on PDFBOX-4569:
----------------------------------------

Looking at the current code this is a huge step forward for a lot of common 
usage patterns.

Given the comments above we might be missing a typical use case though which is 
limiting the memory consumption even when going through the complete PDF. 

That could be implemented with the on demand parser in two ways I'd think
- either by a kind of drop call where parsed objects will be freed
- by freeing them after they have been referenced but leaving it to a gigher 
level to cache these

Thoughts?

I'm also happy if the case outlined is not covered but I think it's important 
that we communicate that clearly - i.e. where the benefits are and which cases 
might not benefit from the on demand approach.    

> Implement an ondemand Parser
> ----------------------------
>
>                 Key: PDFBOX-4569
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4569
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 3.0.0 PDFBox
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>             Fix For: 3.0.0 PDFBox
>
>         Attachments: PDFBOX-1084.pdf
>
>
> There is a need to replace the big bang parser with an ondemand parser



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (PDFBOX-4569) Implement an ondemand Parser

Reply via email to