[
https://issues.apache.org/jira/browse/PDFBOX-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16033281#comment-16033281
]
Tilman Hausherr commented on PDFBOX-3816:
-----------------------------------------
An additional factor is that the original file probably has compressed object
streams. Those saved by PDFBox don't. I tried re-saving p226. The PDFBox
version is 3278KB, the one saved by Adobe Reader has 3194KB.
> PDF split produces inflated file sizes
> --------------------------------------
>
> Key: PDFBOX-3816
> URL: https://issues.apache.org/jira/browse/PDFBOX-3816
> Project: PDFBox
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 2.0.6
> Reporter: Mike Cantrell
> Priority: Minor
> Attachments: high-rez-color-1.pdf, page-files-disk-usage.png,
> screenshot-1.png
>
>
> It seems that there are still issues surrounding inflated PDF page sizes when
> performing split operations. See PDFBOX-1618 for background.
> My test PDF is 97MB and the sum of the resulting pages is 851MB. The
> following demonstrates the issue with the command line tool:
> {code}
> curl https://storage.googleapis.com/pids-share/high-rez-color.pdf -o test.pdf
> java -jar $PDFBOXTOOLS/pdfbox-app-2.0.6.jar PDFSplit test.pdf
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]