[ 
https://issues.apache.org/jira/browse/PDFBOX-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13867714#comment-13867714
 ] 

Thomas Sörensen commented on PDFBOX-1618:
-----------------------------------------

Hi 

So I noticed that each splitted pdf page that was large contained a link to 
another page.
So i tried removing all annotations from each page. 
PDPage.setAnnotations(emptyList).
The total size went down from over 100Mb to 17Mb. The original file is 3MB.
If I then try merge the pages again none of the links work anymore of course.
Can someone give an explanation for this?




> Split PDF file to single page files, some files are inflated in size
> --------------------------------------------------------------------
>
>                 Key: PDFBOX-1618
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1618
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.8.1
>         Environment: Windows 7, JVM 1.6.0_29
>            Reporter: Tom Taylor
>         Attachments: 112080-TECHNICAL MANUAL FOR GENERATOR NIR 7194 A-10LW OF 
> 4038 KVA.pdf, Test_PDFs.zip, internalstructure.png
>
>
> A PDF file is split into single pages for inclusion within another document 
> (pdfbox.utils.Splitter within our code but same phenomenon observed when 
> splitting using command line PDFSplit tool). Som of the pages are almost as 
> large as the original file which causes performance problems for our 
> customers.
> Again, I have a sample pdf to attach.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to