[ 
https://issues.apache.org/jira/browse/PDFBOX-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823593#comment-13823593
 ] 

Ross McPhee commented on PDFBOX-1618:
-------------------------------------

Hi

I was hoping to use the command line tools to split a PDF into two and then 
merge the result around another PDF (in affect, inserting the other PDF 
part-way through the original).

However, the two split PDFs are bascially the same size as the original, even 
if the split is on page 2 of a 119 page document.  This has a dramatic affect 
upon the resultant fie size - doubling it bascially.

Is there anyway of removing the unused references from a PDF from the command 
line?  I was hoping not to have to delve in to the API for this seemingly 
innocuos task (inserting one PDF part-way though another).

> Split PDF file to single page files, some files are inflated in size
> --------------------------------------------------------------------
>
>                 Key: PDFBOX-1618
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1618
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.8.1
>         Environment: Windows 7, JVM 1.6.0_29
>            Reporter: Tom Taylor
>         Attachments: 112080-TECHNICAL MANUAL FOR GENERATOR NIR 7194 A-10LW OF 
> 4038 KVA.pdf, Test_PDFs.zip, internalstructure.png
>
>
> A PDF file is split into single pages for inclusion within another document 
> (pdfbox.utils.Splitter within our code but same phenomenon observed when 
> splitting using command line PDFSplit tool). Som of the pages are almost as 
> large as the original file which causes performance problems for our 
> customers.
> Again, I have a sample pdf to attach.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to