Re: Translation of combining diacritics Type 1 font

Tilman Hausherr Wed, 19 Feb 2020 19:19:36 -0800

Hi,
Please retry with the current version, which is PDFBox 2.0.18, soon 2.0.19.

Then use the DrawPrintTextLocations.java example to see if the cyanbounds are correct. If not, please open an issue for that one. Don'treuse font subsets.

Tilman


Am 20.02.2020 um 04:00 schrieb jd9...@rit.edu:

Hello,
I am currently a researcher at RIT's DPRL, using PDFBox 2.0.7 withMHVHUS+CMR10 Type 1 font and PDFTextStripper. I am interested infinding the matrix (or values) used to translate diacritic elements,or a similar way to find the positioning of diacritic elements.
In my example, the Type 1 font is an embedded subset within the pdfdocument using Type1Encoding. When I access the glyph for thediacritic element eg. dieresis, through getPath, the position of thepath is above the lowercase characters. For uppercase characters, Ican get the diacritic, however the position of the path is the same aslowercase characters, as opposed to placed above the uppercasecharacter. In addition, the name is the combining diacritic. E.G.dieresiscmb, which isn't available in getCharStringsDict or getCharSet.
On a side note, combining diacritical names cause problems when usingthe PDPageContentStream class to showText of the unicode; resulting inan IllegalArgumentException that the combining diacritic does notexist in the font, even when the character's TextPosition and fontwere parsed using PDFTextStripper. Let me know if I should open aticket for this issue.
How are the diacritical accents for Type 1 fonts translated from theirstored location into place?
diacriticdieresis.png
(I have cc'd my advisor)

Thank you,
Jessica Diehl

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: Translation of combining diacritics Type 1 font

Reply via email to