Tim Allison created PDFBOX-2376:
-----------------------------------
Summary: Small regression in text extraction with PDFBox 1.8.7 vs.
1.8.6
Key: PDFBOX-2376
URL: https://issues.apache.org/jira/browse/PDFBOX-2376
Project: PDFBox
Issue Type: Bug
Reporter: Tim Allison
Priority: Minor
On at least one file in govdocs1, less text is being extracted with PDFBox
1.8.7 than was extracted with 1.8.6. When running the app.jar with
ExtractText, 1.8.7 is not extracting:
{noformat}
Designated Counties
No Designation
Individual Assistance
All counties are eligible
ITS Mapping & Analysis CenterWashington, DC
05/09/08 -- 09:36 AM EDT
Source: Disaster Federal Registry Notice05/08/2008
Location Map
MapID 196d109cd27
for Hazard Mitigation
{noformat}
from govdocs1's 894770.pdf.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)