[ 
https://issues.apache.org/jira/browse/TIKA-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217800#comment-14217800
 ] 

Tim Allison commented on TIKA-1482:
-----------------------------------

To add to [~gagravarr]'s comment, if [~lfcnassif]'s recommendation doesn't 
work, before opening an issue on PDFBox's Jira, try grabbing PDFBox 1.8.7 
[here|https://pdfbox.apache.org/downloads.html] and run the app:
{noformat}
java -jar pdfbox-app-x.y.z.jar ExtractText [OPTIONS] <inputfile> [Text file]
{noformat}

> ForkParser throws exceptions when process some large pdf files
> --------------------------------------------------------------
>
>                 Key: TIKA-1482
>                 URL: https://issues.apache.org/jira/browse/TIKA-1482
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>         Environment: Windows 7_x64 / JDK 1.7.0_17
>            Reporter: Sean Zhao
>            Priority: Critical
>             Fix For: 1.6
>
>         Attachments: SRCH-13412.pdf
>
>
> In Tika 1.6, ForkParser throws org.apache.tika.exception.TikaException , 
> message:Unexpected error in forked server process, when parsing some large 
> pdf files.  While tika 1.3 won't.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to