[
https://issues.apache.org/jira/browse/TIKA-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217800#comment-14217800
]
Tim Allison commented on TIKA-1482:
-----------------------------------
To add to [~gagravarr]'s comment, if [~lfcnassif]'s recommendation doesn't
work, before opening an issue on PDFBox's Jira, try grabbing PDFBox 1.8.7
[here|https://pdfbox.apache.org/downloads.html] and run the app:
{noformat}
java -jar pdfbox-app-x.y.z.jar ExtractText [OPTIONS] <inputfile> [Text file]
{noformat}
> ForkParser throws exceptions when process some large pdf files
> --------------------------------------------------------------
>
> Key: TIKA-1482
> URL: https://issues.apache.org/jira/browse/TIKA-1482
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.6
> Environment: Windows 7_x64 / JDK 1.7.0_17
> Reporter: Sean Zhao
> Priority: Critical
> Fix For: 1.6
>
> Attachments: SRCH-13412.pdf
>
>
> In Tika 1.6, ForkParser throws org.apache.tika.exception.TikaException ,
> message:Unexpected error in forked server process, when parsing some large
> pdf files. While tika 1.3 won't.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)