[
https://issues.apache.org/jira/browse/PDFBOX-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16033236#comment-16033236
]
Mike Cantrell commented on PDFBOX-3816:
---------------------------------------
All of the resulting pages are 851MB. It seems that something is inflated or
resources may be duplicated. Inspecting the stream shows a much larger image
size for PDF box so I thought maybe they were re-encoded. The size difference
in the stream makes up for the difference in expected vs actual size.
> PDF split produces inflated file sizes
> --------------------------------------
>
> Key: PDFBOX-3816
> URL: https://issues.apache.org/jira/browse/PDFBOX-3816
> Project: PDFBox
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 2.0.6
> Reporter: Mike Cantrell
> Priority: Minor
> Attachments: high-rez-color-1.pdf
>
>
> It seems that there are still issues surrounding inflated PDF page sizes when
> performing split operations. See PDFBOX-1618 for background.
> My test PDF is 97MB and the resulting pages are 851MB. The following
> demonstrates the issue with the command line tool:
> {code}
> curl https://storage.googleapis.com/pids-share/high-rez-color.pdf -o test.pdf
> java -jar $PDFBOXTOOLS/pdfbox-app-2.0.6.jar PDFSplit test.pdf
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]