[ 
https://issues.apache.org/jira/browse/PDFBOX-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17325612#comment-17325612
 ] 

Jakov Vežić commented on PDFBOX-5169:
-------------------------------------

QPDF, unfortunately, doesn't seem to able to handle this merge. Giving the 
--empty parameter loses basically all bookmarks, links, etc. Without it, it 
fills up whole 8GB of RAM on my VM (and is also single-threaded).

 

We did run tests on multiple software solutions for merging PDFs, and PDFBox 
proved to be the best by far (both with regards to speed and keeping stuff from 
the original PDF), this is basically the first issue someone reported - out of 
some 140k documents.

> PDFMerger produces overly large output PDF
> ------------------------------------------
>
>                 Key: PDFBOX-5169
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5169
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.22, 2.0.23
>         Environment: Debian 10
>            Reporter: Jakov Vežić
>            Priority: Minor
>
> Using PDFMerger to combine
> [https://www.dropbox.com/s/kprk7aeggni420c/1.pdf?dl=1]
> with
> [https://www.dropbox.com/s/0h8bced4tm3gppz/2.pdf?dl=1]
> results in an overly large file. The two input files are 1,25 MB and 16,3 MB 
> large, while the output file is just over 400 MB large. The action also 
> consumes about 1 GB of memory. No errors are produced during the merge that I 
> can tell.
> The command is:
> {code:java}
> java -Xmx2500M -jar pdfbox-app-2.0.23.jar PDFMerger 1.pdf 2.pdf output.pdf
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to