[ 
https://issues.apache.org/jira/browse/PDFBOX-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836451#comment-13836451
 ] 

Maruan Sahyoun commented on PDFBOX-1787:
----------------------------------------

Could you be a little more specific about what you are doing. How do you 
extract the text? Which command line are you talking about. If you are using 
the ExtractText command you can use the -nonSeq option. Using you pdf this will 
not extract any text (as the file is corrupt) but will also not hang and report 
an error.

BR
Maruan

> pdfbox hangs on a corrupt PDF file
> ----------------------------------
>
>                 Key: PDFBOX-1787
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1787
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.3
>         Environment: all
>            Reporter: Hong-Thai Nguyen
>            Priority: Critical
>             Fix For: 1.8.4
>
>         Attachments: corrupt_file.pdf
>
>
> pdfbox hangs on command line on attached file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to