[
https://issues.apache.org/jira/browse/PDFBOX-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823593#comment-13823593
]
Ross McPhee commented on PDFBOX-1618:
-------------------------------------
Hi
I was hoping to use the command line tools to split a PDF into two and then
merge the result around another PDF (in affect, inserting the other PDF
part-way through the original).
However, the two split PDFs are bascially the same size as the original, even
if the split is on page 2 of a 119 page document. This has a dramatic affect
upon the resultant fie size - doubling it bascially.
Is there anyway of removing the unused references from a PDF from the command
line? I was hoping not to have to delve in to the API for this seemingly
innocuos task (inserting one PDF part-way though another).
> Split PDF file to single page files, some files are inflated in size
> --------------------------------------------------------------------
>
> Key: PDFBOX-1618
> URL: https://issues.apache.org/jira/browse/PDFBOX-1618
> Project: PDFBox
> Issue Type: Bug
> Components: PDModel
> Affects Versions: 1.8.1
> Environment: Windows 7, JVM 1.6.0_29
> Reporter: Tom Taylor
> Attachments: 112080-TECHNICAL MANUAL FOR GENERATOR NIR 7194 A-10LW OF
> 4038 KVA.pdf, Test_PDFs.zip, internalstructure.png
>
>
> A PDF file is split into single pages for inclusion within another document
> (pdfbox.utils.Splitter within our code but same phenomenon observed when
> splitting using command line PDFSplit tool). Som of the pages are almost as
> large as the original file which causes performance problems for our
> customers.
> Again, I have a sample pdf to attach.
--
This message was sent by Atlassian JIRA
(v6.1#6144)