[ 
https://issues.apache.org/jira/browse/PDFBOX-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16033311#comment-16033311
 ] 

Tilman Hausherr commented on PDFBOX-3816:
-----------------------------------------

Image and font streams are kept in the exact compression as before. "compressed 
object streams" means that in the original, non compressed structures (e.g. 
font widths list, font descriptors) are compressed. PDFBox can read these 
structures but can't create these. But as you see from my example, the win is 
just 2.6%.

> PDF split produces inflated file sizes
> --------------------------------------
>
>                 Key: PDFBOX-3816
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3816
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Utilities
>    Affects Versions: 2.0.6
>            Reporter: Mike Cantrell
>            Priority: Minor
>         Attachments: high-rez-color-1.pdf, page-files-disk-usage.png, 
> screenshot-1.png
>
>
> It seems that there are still issues surrounding inflated PDF page sizes when 
> performing split operations. See PDFBOX-1618 for background.
> My test PDF is 97MB and the sum of the resulting pages is 851MB. The 
> following demonstrates the issue with the command line tool:
> {code}
> curl https://storage.googleapis.com/pids-share/high-rez-color.pdf -o test.pdf
> java -jar $PDFBOXTOOLS/pdfbox-app-2.0.6.jar PDFSplit test.pdf
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to