Richard Stafford created PDFBOX-4943:
----------------------------------------
Summary: PDF Merge of large document, memory usage?
Key: PDFBOX-4943
URL: https://issues.apache.org/jira/browse/PDFBOX-4943
Project: PDFBox
Issue Type: Bug
Components: Utilities
Affects Versions: 2.0.21
Environment: Windows, java x64
Reporter: Richard Stafford
We are trying to use the PDFMergerUtility to merge a number of PDF files, but
are having an issue with exhausting java heap with a specific PDF file. This
file is about 50mb, but merging with it requires a java heap of 8gb. We've
tried using setupTempFileOnly() but that doesn't seem to help.
Looking further, just doing a PDDocument.load() operation for this file uses
3.5gb of heap, regardless of MemoryUsageSetting() values.
For instance, with the following main()
public static void main(String arg[]) public static void main(String arg[])
{ try
{ // load the document
PDDocument sourceDoc = PDDocument.load( new
File("c:\\tmp\\testfile.PDF"),
MemoryUsageSetting.setupTempFileOnly().setTempDir(new
File("c:\\tmp")) );
sourceDoc.close(); LogMessage.log( "Completed");
}
catch (Exception e)
{ LogMessage.log( "Exception in document load: "+e.toString());
}
}
Setting a breakpoint at the sourceDoc.close(), the heap has increased to 3.5gb.
Our test case can be downloaded from:
[https://s3.amazonaws.com/webdl.equorum.com/misc/testfile.pdf]
Thanks,
Rich Stafford
Chief Scientist
eQuorum Corporation
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]