[ 
https://issues.apache.org/jira/browse/PDFBOX-2742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14394478#comment-14394478
 ] 

Tilman Hausherr commented on PDFBOX-2742:
-----------------------------------------

We have a choice between two evils:
1. ignore global resources, as done since PDFBOX-28
2. not ignore global resources, as done in 2.0. File 239665 of PDFBOX-2101 
produces extremely large files because in that file, everything is a global 
resource. The original and each split are 4MB large.

Possible solution for version 2.0: use code like in the ExtractImages class to 
find out what resources are really used. However I don't know if that code 
would also "visit" all nested streams (e.g. in forms, annotations, etc).



> PDFSplit ignores global resources
> ---------------------------------
>
>                 Key: PDFBOX-2742
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2742
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Utilities, Writing
>    Affects Versions: 1.8.9, 1.8.10
>         Environment: Linux+Java (Oracle)
>            Reporter: Wolfgang S. Kechel
>             Fix For: 1.8.10
>
>         Attachments: original.pdf
>
>
> When using PDFsplit to split the attached one page doc in single pages the 
> result page is incomplete.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to