Evgeny Chesnokov created PDFBOX-2800:
----------------------------------------
Summary: PDFTextStripper calculates the character bounding boxes
incorrectly
Key: PDFBOX-2800
URL: https://issues.apache.org/jira/browse/PDFBOX-2800
Project: PDFBox
Issue Type: Bug
Components: Utilities
Affects Versions: 1.8.9
Environment: java version "1.6.0_35"
Java(TM) SE Runtime Environment (build 1.6.0_35-b10)
Java HotSpot(TM) 64-Bit Server VM (build 20.10-b01, mixed mode)
Reporter: Evgeny Chesnokov
For a specific file the extracted coordinates provided by a TextPosition stored
in a charactersByArticle variable do not match the actual positions of the
characters of the content. Some of the rectangles return with zero heights, and
others appear shifted on a vertical axis. I am attaching the files illustrating
the issue, both the sample file itself and a highlighted bounding rectangles on
the 2nd page that mismatch.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]