[
https://issues.apache.org/jira/browse/PDFBOX-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836451#comment-13836451
]
Maruan Sahyoun commented on PDFBOX-1787:
----------------------------------------
Could you be a little more specific about what you are doing. How do you
extract the text? Which command line are you talking about. If you are using
the ExtractText command you can use the -nonSeq option. Using you pdf this will
not extract any text (as the file is corrupt) but will also not hang and report
an error.
BR
Maruan
> pdfbox hangs on a corrupt PDF file
> ----------------------------------
>
> Key: PDFBOX-1787
> URL: https://issues.apache.org/jira/browse/PDFBOX-1787
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.8.3
> Environment: all
> Reporter: Hong-Thai Nguyen
> Priority: Critical
> Fix For: 1.8.4
>
> Attachments: corrupt_file.pdf
>
>
> pdfbox hangs on command line on attached file.
--
This message was sent by Atlassian JIRA
(v6.1#6144)