[ 
https://issues.apache.org/jira/browse/PDFBOX-5530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17745332#comment-17745332
 ] 

liu edited comment on PDFBOX-5530 at 7/21/23 2:35 AM:
------------------------------------------------------

There is another question, how can I accurately collect the number of 
COSStream, if I rewrite the method of PDFParser, as follows, is the number 
obtained in this way accurate?

public class MonitorPDFParser extends PDFParser {

                private final ConcurrentHashMap<COSStream, String> TASK_MAP = 
new ConcurrentHashMap<>();
                private final AtomicLong cout = new AtomicLong(0);

                protected COSStream parseCOSStream(COSDictionary dic) throws 
IOException
                {
                        COSStream stream = super.parseCOSStream(dic);
                        cout.incrementAndGet();
                        TASK_MAP.put(stream, "");
                        return stream;
                }

        }


was (Author: JIRAUSER297279):
There is another question, how can I accurately collect the number of 
COSStream, if I rewrite the method of PDFParser, as follows, is the number 
obtained in this way accurate?

public class MonitorPDFParser extends PDFParser {

                private final ConcurrentHashMap<COSStream, String> TASK_MAP = 
new ConcurrentHashMap<>();

                protected COSStream parseCOSStream(COSDictionary dic) throws 
IOException
                {
                        COSStream stream = super.parseCOSStream(dic);
                        TASK_MAP.put(stream, "");
                        return stream;
                }

        }

> Java heap space
> ---------------
>
>                 Key: PDFBOX-5530
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5530
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.25
>            Reporter: liu
>            Priority: Blocker
>         Attachments: image-2022-10-20-14-30-19-790.png, 
> image-2022-10-20-14-30-57-332.png, image-2022-10-20-14-32-10-258.png, 
> image-2022-10-20-15-01-06-688.png, image-2022-10-20-19-07-42-632.png, 
> image-2022-10-20-19-08-23-932.png, screenshot-1.png, screenshot-2.png, 
> screenshot-3.png, screenshot-4.png, 引起宕机-1.pdf, 引起宕机.pdf
>
>
> code(only this part of the code):
> PDDocument load = PDDocument.load(file, 
> MemoryUsageSetting.setupTempFileOnly(-1);
>  
> hi. Why do I configure it like this, it still takes up so much memory? What 
> is the effect of using setupTempFileOnly. 
> !image-2022-10-20-14-30-19-790.png!
> !image-2022-10-20-14-30-57-332.png!
> !image-2022-10-20-14-32-10-258.png!
> [^引起宕机.pdf]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to