[
https://issues.apache.org/jira/browse/PDFBOX-1076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John Hewson updated PDFBOX-1076:
--------------------------------
Component/s: Text extraction
> PDF Text Extraction takes 5x longer with some files
> ---------------------------------------------------
>
> Key: PDFBOX-1076
> URL: https://issues.apache.org/jira/browse/PDFBOX-1076
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.3.1, 1.4.0, 1.5.0, 1.6.0
> Reporter: Moshe Immerman
>
> Text extraction occurs on 1 particular file in +- 2 seconds using 1.2.1 but
> from 1.3.1 and up extraction takes +- 11 seconds
> The sample file is confidential, please provide an email address that I can
> send it to so that it is not publicly available?
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)