[ 
https://issues.apache.org/jira/browse/PDFBOX-785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13818828#comment-13818828
 ] 

MH commented on PDFBOX-785:
---------------------------

My tests with PDFBox 1.8.2:

* addPage(): inly a few KB smaller PDF (for PDFs that have multiple megabytes)

* importPage(): results in PDFs upt to 3 times bigger than the original PDF!

Both remove the outlines from th original PDF, so this alone is a show stopper.

> Spliting a PDF creates unnecessarily large files
> ------------------------------------------------
>
>                 Key: PDFBOX-785
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-785
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 0.8.0-incubator, 1.1.0, 1.2.1
>         Environment: Windows XP, openOffice3.0.0, pdfsam
>            Reporter: mathieu radiguet
>         Attachments: fileSizeIssue.zip
>
>
> Using PDFBox0.8.0 (also tryed on 1.1.0 and 1.2.1) to split files result in 
> bigger parts than the original.
> Conserned files where made from openOfice .odt documents in version 3.0.0 
> using openOffice pdf Export and then merging several copies with pdfsam 
> (http://www.pdfsam.org/)
> In joined eclipse project the test file size is 10 712 749  bytes for 
> 2812pages and the result files's sises after spliting in two at page 2300 are 
> : 8 812 515  bytes and 10 701 142  bytes.
> Using pdfSplit in command line as result we have all single result file 
> bigger than the original. An exemple is also joined. An error tells the 
> original file is corrupted but we tryed it on a file (using pdfsam and 
> without using it) with no error and with similar result so i think it's not 
> related. 
> This issue seems similar to : JIRA PDFBOX-28 
> (https://issues.apache.org/jira/browse/PDFBOX-28)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to