[
https://issues.apache.org/jira/browse/PDFBOX-2923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14663000#comment-14663000
]
Tilman Hausherr commented on PDFBOX-2923:
-----------------------------------------
Should your patch have an influence on rendering or on text extraction? I
tested it on rendering and found no difference. If on text extraction, I'd like
to have a sample file, alternatively a way to find one, I have 250000 sample
PDFs. Or does it help when creating PDFs with that attached font?
Btw, has this alleged bug been fixed in the 2.0 version? (It seems so to me)
https://svn.apache.org/viewvc/pdfbox/trunk/fontbox/src/main/java/org/apache/fontbox/cff/CFFParser.java?view=markup&sortby=date
> CFFParser parser treats CIDFont's charset data as SID
> -----------------------------------------------------
>
> Key: PDFBOX-2923
> URL: https://issues.apache.org/jira/browse/PDFBOX-2923
> Project: PDFBox
> Issue Type: Bug
> Components: FontBox
> Affects Versions: 1.8.10
> Reporter: Petras
> Attachments: MyriadPro-Regular.cff, Patch_to_fix_PDFBOX-2923.patch
>
>
> As stated in Compact Font File specification:
> {quote}
> The charset data, although in the same format as non-CIDFonts, will represent
> CIDs rather than SIDs, i.e. charstrings are “named” by CIDs in a CIDFont.
> {quote}
> Unfortunately, {{CFFParser}} does not consider this specific and always treat
> charset data as SID: is looking for SID referenced text in _String INDEX_
> structure. Since such SID-indexed string does not exist there, it sets the
> name of the glyph to "{{.ndef}}".
> Consequently, {{CFFParser}} fails to register correct charstrings, as it
> associates glyph names to them using a map. As there can be several
> charstrings, only the last charstring entry is retained.
> Then also {{CFFFont.getWidth()}} method fails to return correct width of the
> given CID as correct link between CID and charstring is lost.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]