[
https://issues.apache.org/jira/browse/PDFBOX-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17296522#comment-17296522
]
Tilman Hausherr commented on PDFBOX-5115:
-----------------------------------------
Now the soft hyphen is supported when creating PDFs. The other thing isn't
supported. My understanding is that 7F is a bullet when 7F is in a PDF, not
that 7F in a showText string appears as a bullet in a PDF.
Text extraction works that the soft hyphen reappears as such. Adobe Reader
makes it disappear. Maybe we could do something, but I'd like to see real
documents where this is useful.
There will be a SNAPSHOT in a few hours at
https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/2.0.23-SNAPSHOT/
or build yourself (Jenkins is currently down, so this may take some time)
> U+00AD ('sfthyphen') is not available in this font Times-Roman encoding:
> WinAnsiEncoding
> ----------------------------------------------------------------------------------------
>
> Key: PDFBOX-5115
> URL: https://issues.apache.org/jira/browse/PDFBOX-5115
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 2.0.22
> Reporter: Andriy
> Priority: Minor
>
> U+00AD ('sfthyphen') is not available in this font Times-Roman encoding:
> WinAnsiEncoding
>
> this symbol U+00AD are in WinAnsiEncoding by the code but the slightly
> different name
>
> {quote}private static final Object[][] WIN_ANSI_ENCODING_TABLE = {
> // adding some additional mappings as defined in Appendix D of the pdf spec
> ...
> \{0255, "hyphen"}
> {quote}
>
> it is right that both code and name must be equal ?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]