[
https://issues.apache.org/jira/browse/PDFBOX-4805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler resolved PDFBOX-4805.
----------------------------------------
Resolution: Fixed
> Regression in 2.0.19
> --------------------
>
> Key: PDFBOX-4805
> URL: https://issues.apache.org/jira/browse/PDFBOX-4805
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.19
> Reporter: Andreas Lehmkühler
> Assignee: Andreas Lehmkühler
> Priority: Major
> Fix For: 2.0.20
>
>
> Joel Hirsh reported a regression with PDFTextStripper which was introduced
> with 2.0.19, see his post on
> [users@|https://lists.apache.org/thread.html/r35b50f5b00a39dcf6e77637e2ff2e097f26c395628ae476ab37b344a%40%3Cusers.pdfbox.apache.org%3E]
> for details.
> He can't share the pdf in questions due to privacy but did some debugging and
> found out that PDFBOX-4760 is the case for that regression. I accidentally
> committed some [unrelated code|https://svn.apache.org/r1873653] which leads
> to bad text extraction results. As the code targets some corner cases it
> didn't came up as an issue when running our pre release tests. The issue is
> limited to the 2.0 trunk.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]