[
https://issues.apache.org/jira/browse/PDFBOX-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13915128#comment-13915128
]
Tilman Hausherr edited comment on PDFBOX-1207 at 2/27/14 10:30 PM:
-------------------------------------------------------------------
I can't even test it, due to your PDF having an incorrect syntax:
{code}
7752 0 obj
<<
....
>>
7754 0 obj
{code}
The "endobj" is missing. PDFBOX-1811 fixed a similar case, but not this one :-(
was (Author: tilman):
I can't even test it, due to your PDF having an incorrect syntax:
> PDFPageProcessor.processStream() take 10 minutes to return
> ----------------------------------------------------------
>
> Key: PDFBOX-1207
> URL: https://issues.apache.org/jira/browse/PDFBOX-1207
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.6.0
> Environment: Seen on multiple platforms
> Reporter: Dan Krause
>
> Attempting to extract images and text from each page. Long processing time is
> specific to this file:
> http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/pdf/Installation_Guide/Red_Hat_Enterprise_Linux-6-Installation_Guide-en-US.pdf
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)