[ https://issues.apache.org/jira/browse/PDFBOX-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr updated PDFBOX-5115: ------------------------------------ Component/s: Text extraction Rendering FontBox > U+00AD ('sfthyphen') is not available in this font Times-Roman encoding: > WinAnsiEncoding > ---------------------------------------------------------------------------------------- > > Key: PDFBOX-5115 > URL: https://issues.apache.org/jira/browse/PDFBOX-5115 > Project: PDFBox > Issue Type: Bug > Components: FontBox, Rendering, Text extraction > Affects Versions: 2.0.22 > Reporter: Andriy > Priority: Minor > Fix For: 2.0.23, 3.0.0 PDFBox > > Attachments: > PDFBOX-5115-CSIF66CF7JU7CBKTIEEW7QBGWIONBFPG-p1-soft-hyphen.pdf > > > U+00AD ('sfthyphen') is not available in this font Times-Roman encoding: > WinAnsiEncoding > > this symbol U+00AD are in WinAnsiEncoding by the code but the slightly > different name > > {quote}private static final Object[][] WIN_ANSI_ENCODING_TABLE = { > // adding some additional mappings as defined in Appendix D of the pdf spec > ... > \{0255, "hyphen"} > {quote} > > it is right that both code and name must be equal ? -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org