[ 
https://issues.apache.org/jira/browse/PDFBOX-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17365244#comment-17365244
 ] 

yoonho edited comment on PDFBOX-5216 at 6/18/21, 5:08 AM:
----------------------------------------------------------

[~tilman]

Hello I'm also looking at page 26 carefully. There are many duplicate images on 
that page. I need a way to get rid of duplicate or unused objects.

I found that removing some of the image objects from the Im0 to Im12 objects on 
page 26 did not affect the output.

Is there any other way to distinguish between obsolete objects?

This problem seems difficult to me. Thank you for your help.


was (Author: chae):
[~tilman]

Hello I'm also looking at page 26 carefully. There are many duplicate images on 
that page. I need a way to get rid of duplicate or unused objects. I found that 
removing some of the image objects from the Im0 to Im12 objects on page 26 did 
not affect the output. Is there any other way to distinguish between obsolete 
objects? This problem seems difficult to me. Thank you for your help.

> Is there a way to optimize by cleaning up duplicate objects?
> ------------------------------------------------------------
>
>                 Key: PDFBOX-5216
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5216
>             Project: PDFBox
>          Issue Type: Wish
>            Reporter: yoonho
>            Priority: Major
>         Attachments: samepage.png, 스크린샷 2021-06-15 오후 2.02.21.png
>
>
> Is there a way to clean up duplicate objects using PDFBox?
> [http://gofile.me/4hSqO/Cis33w0Sa] - Original
> [http://gofile.me/4hSqO/7XKmWqUBB]  - Clean version
> I applied the Adobe DC's Optimize option (relevant in the attached file). As 
> a result, a 48mb PDF file was reduced to 19mb. I think this is due to 
> cleaning up duplicate objects in the PDF.
> Am I right? I would like to implement this process with PDFBox. How should I 
> approach it?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to