Richard Stafford created PDFBOX-4943:
----------------------------------------

             Summary: PDF Merge of large document, memory usage?
                 Key: PDFBOX-4943
                 URL: https://issues.apache.org/jira/browse/PDFBOX-4943
             Project: PDFBox
          Issue Type: Bug
          Components: Utilities
    Affects Versions: 2.0.21
         Environment: Windows, java x64
            Reporter: Richard Stafford


We are trying to use the PDFMergerUtility to merge a number of PDF files, but 
are having an issue with exhausting java heap with a specific PDF file.  This 
file is about 50mb, but merging with it requires a java heap of 8gb.  We've 
tried using setupTempFileOnly() but that doesn't seem to help. 

Looking further, just doing a PDDocument.load() operation for this file uses 
3.5gb of heap, regardless of MemoryUsageSetting() values.

For instance, with the following main()

public static void main(String arg[]) public static void main(String arg[])

{ try

    { // load the document

        PDDocument sourceDoc = PDDocument.load( new 
File("c:\\tmp\\testfile.PDF"),

                          MemoryUsageSetting.setupTempFileOnly().setTempDir(new 
File("c:\\tmp")) );

        sourceDoc.close(); LogMessage.log( "Completed");

    }

  catch (Exception e)

  {   LogMessage.log( "Exception in document load: "+e.toString());

  }

}

Setting a breakpoint at the sourceDoc.close(), the heap has increased to 3.5gb.

 

Our test case can be downloaded from:

[https://s3.amazonaws.com/webdl.equorum.com/misc/testfile.pdf]

 

Thanks,

Rich Stafford

Chief Scientist

eQuorum Corporation



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to