Maxime Veron created PDFBOX-3841:
------------------------------------
Summary: OpenText - Exstream PDFs textual informations are
recognized with erroneous font size
Key: PDFBOX-3841
URL: https://issues.apache.org/jira/browse/PDFBOX-3841
Project: PDFBox
Issue Type: Bug
Affects Versions: 1.8.13, 1.8.12
Environment: Windows/Linux
Reporter: Maxime Veron
Attachments: 436bc378-aa42-43e5-a4de-a0bbaf233f79-4.PDF vII.PDF
I'm creating this bug report because I observed a particular behavior in PDFBox
parsing of text positions.
I used the command line tool (PrintTextPositions) to generate the text
positions and remove any code logic our application could have over PDFBox and
here is an extract of the result :
String[17.279999,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0
width=3.9960938]-
String[21.6,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0
width=5.999998]*
String[28.08,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0
width=3.9960918]-
String[31.679998,18.800049 fs=*50.0 *xscale=12.0 height=6.918 space=3.0
width=3.000002]
The font size is reported as 50.0 (which is indeed contained in the PDF as 50)
but Adobe and other PDF viewers do not seem to take into account this font of
size 50 to render the text selection. Should we rely on another metric from the
text position, or could this font size be reported relative to the correct
appearance the text should have?
Best regards
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]