[
https://issues.apache.org/jira/browse/PDFBOX-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14180390#comment-14180390
]
John Hewson commented on PDFBOX-2445:
-------------------------------------
{quote}
images are decoded even when only text extraction is done. This happens when
the resources map is built.
{quote}
This is indeed no longer the case in 2.0, since my last few commits.
> Out of Memory - Extract text for Apache_Solr_4.7_Ref_Guide.pdf
> --------------------------------------------------------------
>
> Key: PDFBOX-2445
> URL: https://issues.apache.org/jira/browse/PDFBOX-2445
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing, PDModel
> Affects Versions: 1.8.7, 2.0.0
> Reporter: Maruan Sahyoun
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)