[ 
https://issues.apache.org/jira/browse/PDFBOX-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16033281#comment-16033281
 ] 

Tilman Hausherr commented on PDFBOX-3816:
-----------------------------------------

An additional factor is that the original file probably has compressed object 
streams. Those saved by PDFBox don't. I tried re-saving p226. The PDFBox 
version is 3278KB, the one saved by Adobe Reader has 3194KB.

> PDF split produces inflated file sizes
> --------------------------------------
>
>                 Key: PDFBOX-3816
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3816
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Utilities
>    Affects Versions: 2.0.6
>            Reporter: Mike Cantrell
>            Priority: Minor
>         Attachments: high-rez-color-1.pdf, page-files-disk-usage.png, 
> screenshot-1.png
>
>
> It seems that there are still issues surrounding inflated PDF page sizes when 
> performing split operations. See PDFBOX-1618 for background.
> My test PDF is 97MB and the sum of the resulting pages is 851MB. The 
> following demonstrates the issue with the command line tool:
> {code}
> curl https://storage.googleapis.com/pids-share/high-rez-color.pdf -o test.pdf
> java -jar $PDFBOXTOOLS/pdfbox-app-2.0.6.jar PDFSplit test.pdf
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to